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Synopsis 



Introduction 

A central assumption in modern cosmology is that the universe on large scales is homogeneous 
and isotropic [3J . This assumption leads to tremendous simplifications in the application of general 
relativity (GR) to cosmology, since it reduces the ten independent components of the metric of 
spacetime gab{t, x) to essentially a single function of time a{t) known as the scale factor. In the early 
days of modern cosmology, beginning with stalwarts such as Einstein and deSitter, the assumption 
of homogeneity and isotropy was largely motivated on grounds of simplicity and aesthetic appeal. In 
recent times however, it has become possible to confront this assumption with observations, which 
remarkably appears to be justified to a large extent (based on observations of the cosmic microwave 
background (CMB) radiation [5], and on analyses of galaxy surveys 1317]). This indicates that a 
model based on essentially a single function of time might in fact go a long way in furthering our 
understanding of the behaviour of the universe. 

Of course the real universe is not homogeneous; we see a rich variety of structure around us from 
stellar systems to galaxies to clusters of galaxies and even larger structures. The study of the large 
scale structure (LSS) in the universe has a long history going back several decades [10]. Perhaps 
one of the biggest successes of cosmological theory based on GR, has been the explanation of how 
statistical properties of the LSS arise |lT]. The relevant calculations are largely based on linear 
perturbation theory, in which one describes inhomogeneities in the universe as perturbations around 
the smooth solution characterised by the scale factor a{t) and expands the Einstein equations as 
a series in these small perturbations. While such a treatment has met with great success in the 
description of the statistical properties of the tiny fiuctuations (anisotropies) in the temperature of 
the CMB, there are two causes for concern. 

The first is a purely theoretical issue, and is the basis of this work. The idea that the large scale 
universe is homogeneous and isotropic necessarily entails an implicit notion of averaging on these 
large scales. In other words, what one is really saying is, "When the spatially fluctuating parts of 
the solution of GR describing our universe are averaged out, what is left is the homogeneous and 
isotropic solution of Einstein's equations"0. The immediately obvious problem with this statement 

^This solution is known as the Friedmann-Lemaitre- Robertson- Walker (FLRW) solution after those who first 
studied it. 



Ill 



is that the details of the averaging operation are not at all clear, and indeed are usually never 
specified. A bigger problem is one noted by Ellis [T7], and can be stated in the following symbolic 
way. If g denotes the metric, F the Christoffel connection and E[g] the Einstein tensor for the 
metric g, then we have the relations 

Tr^dg ; E[g]r^dT + T\ (1) 

with d denoting spacetime derivatives. The Einstein equations are therefore 

E[g]=T, (2) 

with T denoting the energy-momentum tensor of the matter components. Now, irrespective of any 
details of the averaging operation, one notes that 

E[{g)]-{E[g])r^{rf-{r')^0, (3) 

with the angular brackets denoting the averaging. The FLRW solution would amount to solving 
the equations E[{g)] = (T). In general therefore, it is not true that averaging out the fluctuating 
inhomogeneities leaves behind the FLRW solution, since what we are actually left with is 

E[{g)] = {T)-C ; C~(F2)-(F)^ (4) 

and the homogeneous solution that we are looking for will depend on the details of the correction 
terms C. 

The second cause for concern comes from observations. It has now been established beyond a 
reasonable doubt, that the FLRW metric confronted with observations indicates an accelerating 
scale factor [IS] . Conventional sources of energy such as radiation and nonrelativistic matter cannot 
explain the acceleration, and it is now common to attribute this effect to a hitherto unknown 
"Dark Energy", which in its simplest form is a cosmological constant. The true nature of this 
additional component in the cosmological equations, is perhaps the most challenging puzzle facing 
both theorists and observers today. A huge amount of research has gone into (a) explaining the value 
that a cosmological constant term must take to explain data or (b) assuming a zero cosmological 
constant, constructing models of a dynamical dark energy which explains the observed acceleration 
|19j . It is fair to say however that there is no theoretical consensus on what the origin of Dark 
Energy is. 

Since we have seen above that the effects of averaging lead to some extra, as yet unknown terms 
in the equations, it is natural to ask whether these two issues are connected. Could the acceleration 
of the universe be explained by the effects of averaging inhomogeneities ( "backreaction" ) in the 
universe? Regardless of the answer to this question, what is the nature and magnitude of this 
backreaction? The purpose of this thesis is to answer these questions as rigorously as possible. 
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The conventional wisdom, and loopholes 



One should note that the conventional wisdom on the issue of backreaction in the sense described 
in the previous section, is that this effect can never be significant. It is of importance therefore to 
understand this argument and its shortcomings. The argument goes as follows [27|. One starts by 
assuming that inhomogeneities in the universe can be described in the Newtonian approximation 
of GR, by the gravitational potential ^{t,x) with |<I>| <^ 1, which satisfies the Poisson equation 
= AirGa'^dp where 6p{t,x) is the fluctuation of matter density about the mean homogeneous 
value p{t), and can in general have a large value. (E.g., in clusters of galaxies one finds 6 = Sp/p ~ 
10^, and the ratio increases on smaller length scales). One then argues that the universe we observe 
does seem to be very well-described by the above model, and effects of averaging this model can 
only arise at second order in $ and should hence be extremely small. 

There is a loophole in this argument though. The catch is that the background expansion a(t) 
is defined completely ignoring the backreaction, which is an integrated effect with contributions 
from a large range of length scales. This means that the following possibility cannot be a priori 
ruled out : Initial conditions are specified as a perturbation around a specific FLRW solution, 
but the integrated effect of the backreaction grows (with time) in such a manner as to effectively 
yield a late time solution which is a perturbation around a different FLRW model. Indeed, there 
are calculations in the literature that do indicate such a possibility being realised [28^ \3T\ I34j . 
We therefore see the need to actually perform a rigorous calculation that will describe the time 
evolution of the backreaction, and thereby either confirm or overthrow the conventional wisdom. 

Averaging schemes 

A major hurdle in computing the effects of averaging has been the lack of reliable averaging pro- 
cedures which can be used in GR, mainly because defining and physically interpreting averaging 
operations suitable for tensors, is a challenging prospect. A number of authors have attempted to 
solve this problem, both in the specific context of cosmology and also as a more general problem of 
the mathematics of GR (see, e.g. Refs. [251 [531 [211 ) . To date, the most promising averaging 

schemes have been the spatial averaging of scalars due to Buchert |401 \^T\ , and the fully covariant 
tensor averaging due to Zalaletdinov [13| . 

Buchert's scalar averaging deals with a chosen 3-1-1 splitting of spacetime, and only averages 
two of the Einstein equations. This averaging scheme is simple to implement and intuitively easy to 
grasp, however it is ultimately difficult to interpret its physical significance. Zalaletdinov's scheme 
on the other hand, is technically challenging to handle, since its averaging operation can deal with 
full-fiedged tensors at the cost of introducing some new mathematical structures into the problem. 
The appeal of this scheme lies in the fact that ultimately one has in hand an object which can 
legitimately be called the "averaged metric" on an "averaged manifold" . 

In this thesis we use both these schemes to address certain specific questions concerning the 
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backrcaction problem. As wc discuss below however, ultimately we rely upon Zalaletdinov's scheme 
to make realistic statements regarding the nature of cosmological backreaction. 

Can backreaction ever be leirge? 

A Paranjape and T P Singh, Class. Quant. Grav. 23, 6955 (2006). 

One question to ask in the context of the conventional wisdom presented above, is whether it 
is technically possible within GR to have a situation in which the backreaction dominates the 
averaged expansion. We answer this in the affirmative by studying a toy model. We use the exact, 
spherically symmetric Lemaitre-Tolman-Bondi (LTB) solution of GR, to construct a parametrized 
toy model for a curvature dominated universe, i.e. - a spacetime in which the spatial curvature 
of the 3-dimensional slices dominates over the contribution of the (nonrelativistic) matter. Using 
Buchert's averaging scheme we find that the effective scale factor obtained in this toy model, does 
in fact accelerate for a wide class of parameter values. Now one needs to ask whether this can 
happen in the real universe, for which we turn to Zalaletdinov's approach. 

Simplifying Zalaletdinov's framework 

A Paranjape and T P Singh, Phys. Rev. D76, 044006 (2007). 

As it stands, Zalaletdinov's averaging framework deals with averaging an arbitrary spacetime, and 
due to its generality it is technically challenging and difficult to work with. By restricting its 
application to cosmology and requiring consistency with basic cosmological assumptions, we find 
that we can simplify this framework and bring it to a form which can be readily applied to perform 
calculations. Doing this also clarifies the nature of the backreaction as being a physically relevant 
quantity on the same footing as the scale factor of the FLRW spacetime. An important point is 
that for cosmology one must necessarily consider the spatial averaging limit of Zalaletdinov's 4- 
dimensional spacetime averaging. In this limit we further highlight the similarities and differences 
between Zalaletdinov's and Buchert's approaches, and the fact that the structure of the correction 
terms in both approaches is very similar, being essentially the same as expected from the heuristic 
arguments of Ellis discussed earlier. 

There is a significant difference between the original philosophy of the averaging formalism, 
common to both the Buchert and Zalaletdinov schemes, and the manner in which we employ Za- 
laletdinov's averaging. The original idea as developed by these authors was to construct a framework 
which would independently describe a suitably defined averaged dynamics, with no reference to the 
inhomogeneous spacetime whose average leads to this dynamics. So, for example, Zalaletdinov for- 
mulates a new theory of gravity (named Macroscopic Gravity or MG) which attempts to describe 
the dynamics of an averaged manifold, with no recourse to the underlying manifold which is de- 
scribed by the usual Einstein equations. The backreaction in this approach is actually a new field 
in the problem which satisfies its own equations and whose dynamics must be solved for simulta- 
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ncously with that of other fields such as the averaged metric and the averaged energy-momentum 
tensor for matter. 

Our approach to the backreaction issue is different : We consider it central to be able to self- 
consistently describe both the inhomogeneous geometry as well as its averaged counterpart. We find 
this necessary since modern cosmology crucially relies on observations of inhomogeneities around 
us, and ignoring the evolution of inhomogeneities when solving for the averaged dynamics does not 
appear to be satisfactory. Put another way, when faced with a solution of the averaged dynamics, 
we find it essential to answer the question "which (if any) inhomogeneous solution could lead to this 
averaged homogeneous solution?" All our calculations therefore focus on solving for the averaged 
dynamics of specific inhomogeneities, which we attempt to keep as realistic as possible. 

Backreaction in (linear) perturbation theory 

A Paranjape, Phys. Rev. D78, 063522 (2008). 

Wc apply the simplified version of Zalaletdinov's scheme to the problem of calculating the back- 
reaction in the pcrturbative framework and determining its evolution. We discuss the issue of a 
possible gauge dependence of the backreaction, which is essentially the problem that in the pcrtur- 
bative framework one must be careful to distinguish physical effects from artifacts of choosing a 
specific coordinate system. We show how the backreaction can be calculated in a gauge indepen- 
dent manner, although one is forced to make certain choices concerning the averaging operation 
itself, which are not fixed by Zalaletdinov's formalism. Once the formalism is developed, we are 
left with expressions for the backreaction that are valid whenever the metric of spacetime can be 
described as a perturbation around its FLRW form, regardless of the magnitude of matter density 
fluctuations. 

To deal with the issue of self-consistency, we propose an iterative procedure to compute the 
backreaction. Since order-of-magnitudc estimates of the backreaction indicate that the effect is 
expected to be small in the early universe (around the epoch of last scattering say), wc begin with 
a "zeroth iteration" in which the backreaction is assumed to vanish entirely. This of course is simply 
the setup for the standard treatment of cosmology, in which the evolution of inhomogeneities can 
be numerically evaluated. We do this and consequently obtain a first estimate for the backreaction 
using the formalism developed earlier. We find that the magnitude of the backreaction in this 
first estimate remains negligible (~ 10~^ at present epoch in dimensionless units) compared to 
the background contribution at all times, and its evolution indicates that continuing to further 
iterations would not lead to any instability; the final answer is expected to converge to a form very 
close to the original "zeroth order" choice for the background. 
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The nonlinear regime 



The preceding arguments however are vahd in the linear regime of perturbation theory where matter 
fluctuations are smah. However, the structure of the integrands of the backreaction functions 
indicate that the contribution from length scales where matter fluctuations have become nonlinear, 
should in fact also remain negligible. Yet one would like to see this in an actual nonlinear calculation 
rather than relying on heuristic arguments. Specifically, in the nonlinear regime when matter 
fluctuations are large, one needs needs to address two issues : (a) Is a perturbative expansion in 
the metric still valid? (b) If so, then is the contribution to the backreaction from nonlinear scales 
in fact negligible? 

A toy model for structure formation 

A Paranjape and T P Singh, JCAP03(2008)023; 

It has been claimed in the literature [37] that perturbation theory does not give correct insight 
into the problem of structure formation in the late universe, and that when one studies simple but 
nonperturbative examples of structure formation, the contribution of backreaction to the averaged 
dynamics is in fact large. These claims are clearly contradictory to the conventional wisdom and 
to the arguments presented earlier. We attempt to sort out this debate by studying a toy model of 
structure formation, using the spherically symmetric LTB solution. 

The matter source in the LTB solution is pressureless "dust" , which is sufficient for our purposes 
since we wish to enquire whether a universe dominated by nonrelativistic matter can have a large 
backreaction component in realistic situations. We assume initial conditions to be a perturbation 
around an FLRW model without dark energy. Our model contains an inner spherical overdense ball 
surrounded by an underdense shell, outside which we take the matter density to be homogeneous. 
The overdense ball initially expands but eventually turns around and begins collapsing, mimicking 
for example the infalling region outside clusters of galaxies. This happens because this inner region 
satisfies equations which are identical to those of a "closed" FLRW model in which the universe 
eventually recollapses. Naively one would expect that the underdense region behaves like the 
"open" FLRW models which expand forever, however the situation is more involved. It turns 
out that imposing appropriate matching conditions at the boundary of the over- and underdense 
regions, implies that a section of the underdense region immediately surrounding the overdense 
ball, will in fact eventually collapse. 

This result has interesting consequences. By simply ignoring this part of the underdense shell 
that eventually turns around, we can show using our model that arguments such as in Ref. [37J 
which ignore matching conditions across boundaries can lead to an accelerating effective scale factor. 
However, as soon as this region is correctly accounted for, the acceleration disappears. 

Consistently with this result, we show that one can rewrite the nonperturbative LTB solution 
as a perturbation around the same FLRW model we started with, provided the peculiar velocity of 
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the dust remains nonrelativistico. This is exactly what one expects from standard textbook results 
concerning cosmology in the Newtonian limit of GR. The small parameter governing the linear 
perturbation theory valid in the early universe is the magnitude of the matter density contrast 5, 
while the small parameter relevant for the late time Newtonian limit is the nonrelativistic peculiar 
velocity v. Our calculation is an explicit demonstration in a physically clear and simple setting, of 
how the perturbation theory in 5 becomes a perturbation theory in v. 

Backreaction in the nonlinear regime 

A Paranjape and T P Singh, Phys. Rev. Lett. 101, 181101 (2008). 

As a final step, for completeness we compute the backreaction in our model of structure formation 
described above. The formalism described in the preceding sections can be applied to this model 
since we can bring the metric of this model to the perturbed FLRW form. There are some subtleties 
regarding the numerical calculations, since the coordinates that are natural to the model are not 
natural to the backreaction formalism, but these can be handled in a straightforward manner. 
As expected, we find that the backreaction for such a model of structure formation is in fact 
negligible. The significance of this calculation is that it is the first one in which the backreaction 
has been calculated as a physically meaningful quantity even in the late time nonlinear phase of 
the cosmological evolution. 

Conclusions and Outlook 

The question of whether backreaction from averaging of inhomogeneities can lead to significant 
effects, has generated a heated debate in the literature. There have been confiicting results on issues 
such as the stability of perturbation theory in the presence of these corrections. Our calculations 
using Zalaletdinov's covariant averaging scheme applied to both linear perturbation theory and toy 
models of nonlinear structure formation, form the first systematic demonstration that perturbation 
theory is in fact stable against corrections due to backreaction, and that backreaction cannot 
explain the late time acceleration of the universe. In principle such calculations can be extended 
to numerical simulations of structure formation, however that is beyond the scope of this work. 

Given that cosmological data is rapidly increasing in quantity and improving in quality, it will 
soon become possible to determine cosmological parameters with percent level accuracy [59], and 
even perform tests of fundamental assumptions such as the Copernican principle [60j. In this 
context, it becomes interesting to ask whether the contribution of the backreaction, while very 
small compared to the background, could be tested or used to improve parameter estimation. This 
remains a subject for future work. 



^The peculiar velocity is defined as the difference between the physical velocity and the Hubble flow of the dust 
element. 
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Chapter 1 

Introduction 



Our understanding of the universe has undergone dramatic changes in the last century. Edwin 
Bubble's discovery in 1924 that stars known as Cepheid variables could be found in the Andromeda 
nebula and appeared fainter than Cepheids in the Milky Way, established that such nebulae were 
not part of the Milky Way but were in fact distant galaxies themselves. And his demonstration 5 
years later that these galaxies appear to be receding from the Milky Way at speeds proportional 
to their distances, has found its way into popular consciousness as the maxim "the universe is 
expanding" [1]. On the theoretical front, Einstein's general relativity had at this time quickly gained 
acceptance as a fundamental theory of gravity, and its application to cosmology was being studied 
by several workers including Einstein himself, deSitter, Lemaitre, Friedmann among others. The 
simple models of a universe described by the homogeneous and isotropic geometries characterised 
by the Priedmann-Lemaitre-Robertson- Walker (FLRW) metric, were very successful at describing 
the then limited amount of cosmological observations. The decades since have seen the emergence 
of the highly successful Big Bang model of cosmology, which posits that the universe went through a 
very hot dense phase at early times and cooled as it expanded, with tiny fluctuating inhomogeneities 
in the past that have grown to form structures such as galaxies today [2]. 

A central assumption in this (widely accepted) model of cosmology is that the universe on 
large scales is homogeneous and isotropic [3]. This assumption leads to tremendous simplifications 
in the application of general relativity (GR) to cosmology, since it reduces the ten independent 
components of the metric of spacetime gabit,x) to essentially a single function of time a{t) known 
as the scale factor. In the early days of 20*^^ century cosmology, the assumption of homogeneity 
and isotropy was largely motivated on grounds of simplicity and aesthetic appeal. In recent times 
however, it has become possible to confront this assumption with observations, which remarkably 
appears to be justified to a large extent (based on observations of the CMB radiation [U [5], and 
on analyses of galaxy surveys O [3 [8] , although see Ref. [9] ) . This indicates that a model based 
on essentially a single function of time might in fact go a long way in furthering our understanding 
of the behaviour of the universe. 
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Of course the real universe is not homogeneous; we see a rich variety of structure around us 
from stellar systems to galaxies to clusters of galaxies and even larger structures [10]. Perhaps 
one of the biggest successes of cosmological theory based on GR, has been the explanation of how- 
statistical properties of the large scale structure arise [11]. The relevant calculations are largely 
based on linear perturbation theory (i.e. linearizing Einstein's equations around the smooth FLRW 
solution) which is valid at all length scales of interest at early times and on large scales at late times 
[12\ [T3] . Dynamics on small scales at late times involves nonlinear theory, and is dealt with using 
approximation schemes such as the Press-Schechter formalism and its extensions [H], "Newtonian" 
nonlinear perturbation analyses [15] and numerical simulations [16]. While such treatments have 
met with great success in the description of the statistical properties of the anisotropics in the 
temperature of the CMB, as well as of the inhomogeneous distribution of galaxies, there are two 
causes for concern. 

The first is a purely theoretical issue, and is the basis of this work. The idea that the large scale 
universe is homogeneous and isotropic necessarily entails an implicit notion of averaging on these 
large scales. In other words, what one is really saying is, "When the spatially fluctuating parts 
of the solution of GR describing our universe are averaged out, what is left is the homogeneous 
and isotropic FLRW solution of Einstein's equations" . The immediately obvious problem with this 
statement is that the details of the averaging operation are not at all clear, and indeed are usually 
never specified. A bigger problem is one noted by Ellis [17], and can be stated in the following 
symbolic way. If g denotes the metric, T the Christoffel connection and E[g] the Einstein tensor 
for the metric g, then we have the relations 

Tr^dg ; E[g]r^dT + T\ (1.1) 

with d denoting spacetime derivatives. The Einstein equations are therefore 

E[g]=T, (1.2) 

with T denoting the energy-momentum tensor of the matter components. Now, irrespective of any 
details of the averaging operation, one notes that 

E[{9)]-{E[g])r^{rf-{r')^0, (1.3) 

with the angular brackets denoting the averaging. The FLRW solution would amount to solving 
the equations E[{g)] = (T). In general therefore, it is not true that averaging out the fluctuating 
inhomogeneities leaves behind the FLRW solution, since what we are actually left with is 

E[{g)] = {T)-C ; Cr.{r')-{rf, (1.4) 

and the homogeneous solution that we are looking for will depend on the details of the correction 
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terms C. 

The second cause for concern comes from observations. It has now been estabUshed beyond a 
reasonable doubt, that the FLRW metric confronted with observations indicates an accelerating 
scale factor [18] . Conventional sources of energy such as radiation and nonrelativistic matter cannot 
explain the acceleration, and it is now common to attribute this effect to a hitherto unknown 
"dark energy", which in its simplest form is a cosmological constant. The true nature of this 
additional component in the cosmological equations, is perhaps the most challenging puzzle facing 
both theorists and observers today. A huge amount of research has gone into (a) explaining the value 
that a cosmological constant term must take to explain data or (b) assuming a zero cosmological 
constant, constructing models of a dynamical dark energy which explains the observed acceleration 
|19| . It is fair to say however that there is no theoretical consensus on what the origin of dark energy 
is. Since we have seen above that the effects of averaging lead to some extra, as yet unknown terms in 
the equations, it is natural to ask whether these two issues are connected. Could the acceleration 
of the universe be explained by the effects of averaging inhomogeneities ( "backreaction" ) in the 
universe? Regardless of the answer to this question, what is the nature and magnitude of this 
backreaction? The purpose of this thesis is to answer these questions as rigorously as possible. 

1.1 History of the averaging problem 

The problem of averaging in general relativity has a history going back even further than Ellis' 
work of 1984. In the context of gravitational radiation, the problem of second order effects of 
gravity waves on the large scale background metric of spacetime was studied by Isaacson in the 
1960's |20j in the "short-wavelength" approximation. Isaacson used an averaging operation which 
he called the "BH assumption" after Brill and Hartle [2Tj , which was suited to studying the effects 
of perturbative gravity waves in a spacetime region encompassing many wavelengths. An attempt 
to generalize Isaacson's results was made by Noonan [22], who introduced a different averaging 
procedure which was also constructed for situations where inhomogeneities were perturbative in 
nature. Interest in the cosmological consequences of such an averaging picked up only after Ellis very 
clearly laid down the problems and possibilities that open up when the idea of averaging in general 
relativity is taken seriously. An example is the work of Futamase [23], who introduced a spatial 
averaging procedure after performing a 3 + 1 splitting of spacetime, and computed backreaction 
terms arising from averaging second order perturbations, finding them to be negligibly small (see 
also Ref. [24J). Another example is the work of Boersma [25j, who attempted to construct a 
gauge-invariant (i.e. coordinate independent) averaging procedure in perturbation theory, and also 
estimated that backreaction effects remain negligibly small at the present epoch. (For other work 
on the averaging problem, see Ref. |26j.) 

It may seem intuitively obvious that perturbatively small inhomogeneities can only lead to 
negligibly small backreaction effects. Indeed, this has been the conventional wisdom on this sub- 
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ject, and has recently been spelt out by Ishibashi and Wald [27]. One starts by assuming that 
inhomogeneities in the universe can be described in the Newtonian approximation of GR, by the 
gravitational potential with |$| <C 1, which satisfies the Poisson equation V^<I> = AirGa^Sp 

where 6p{t,x) is the fluctuation of matter density about the mean homogeneous value p{t), and 
can in general have a large value. (E.g., in clusters of galaxies one finds 6 = Sp/p ~ 10^, and the 
ratio increases on smaller length scales). One then argues that the universe we observe does seem 
to be very well-described by the above model, and effects of averaging this model can only arise at 
second order in $ and should hence be extremely small. 

There is a loophole in this argument though. The catch is that the background expansion a(t) 
is defined completely ignoring the backreaction, which is an integrated effect with contributions 
from a large range of length scales. This means that the following possibility cannot be a priori 
ruled out : Initial conditions are specified as a perturbation around a specific FLRW solution, but 
the integrated effect of the backreaction grows (with time) in such a manner as to effectively yield 
a late time solution which is a perturbation around a different FLRW model. Indeed, there are 
calculations in the literature that do indicate that this may happen. For example, Martineau and 
Brandenberger [28] showed in a toy model that long wavelength fluctuations can give rise to a 
backreaction contribution which has a late-time effective equation of state similar to a cosmological 
constant. Their calculations were based on the averaging procedure developed by Abramo et al. [29j 
in the context of backreaction in inflationary cosmology. Other claims to solving the dark energy 
problem using backreaction from long wavelength fluctuations were made by Barausse et al. |30) 
and Kolb et al. [31]. It is fair to say however, that such claims have been controversial. A number of 
authors have argued that when effects of long wavelength fluctuations are suitably "renormalized" 
and the background suitably redefined, the backreaction cannot lead to acceleration of the scale 
factor [32]. Nevertheless, what is definitely true is that the idea of backreaction of cosmological 
fluctuations has generated a lively debate in the community [271 ESI EH EH] . 

In this thesis we will not deal with the effects of long wavelength fluctuations, although we will 
see that certain assumptions need to be made in order to define a self-consistent perturbation theory 
in the presence of an averaging operation. A separate and equally interesting question, which will 
be the main focus of this work, is whether cosmological perturbation theory is stable in the presence 
of the backreaction contribution. There are results in the literature which indicate that this might 
not be the case, and that the backreaction can grow with time in such a manner that at late times 
(when matter fluctuations have become nonlinear) perturbation theory in the metric also no longer 
holds [31] (see also Ref. [36] )• In the same vein, there are arguments using nonperturbative toy 
models of gravitational collapse and nonlinear structure formation, which suggest that perturbation 
theory may not give correct insight into gravitational dynamics at late times in cosmology [37]. If 
these results are relevant for the real world, then it not only means that the conventional wisdom 
is badly failing, but in fact implies that all of late-time cosmology must be reworked from scratch 
(see, e.g. Ref. [55]; also see however Ref. [32]). On the other hand, if these results are for some 
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reason or other not realistic, then it is important ask what is wrong with such arguments, and 
further what the correct approach to the problem is. 

Clearly, to make any headway in this problem, it is first essential to have a reliable averaging 
scheme at hand. Since the questions one is asking deal with the stability of cosmological pertur- 
bation theory, this averaging scheme needs to be inherently nonperturbative, i.e. the validity of 
perturbation theory should not be a prerequisite to defining the averaging prescription. This thesis 
will deal with two averaging schemes present in the literature : the spatial averaging of scalars 
defined by Buchert [40\ I^Tj H2] , and the fully covariant tensor averaging defined by Zalaletdinov 
[4:3\ I45j . Some very interesting early work on possible nonperturbative effects of averaging was 
by Buchert and Ehlers [l6], followed up by Ref. [47j . in the context of spatial averaging in Newto- 
nian cosmology. Buchert's averaging operation in general relativity has since been used by several 
authors to explore the effects of backreaction in various situations [381 HSl HHl IMl EI] , including the 
perturbative contexts mentioned above [MIES], and has also been compared against observations 
[52j . As we shall see later, this averaging scheme has an appealing simplicity of implementation, 
which could be a reason for the amount of attention it has received. In contrast, Zalaletdinov's 
averaging scheme (which was developed earlier than Buchert's work) is technically rather challeng- 
ing to handle and involves a fair amount of complicated algebra. Its strength however lies in the 
fact that it is a fully covariant prescription which, at the end of the day, yields an object which 
can be legitimately called the "averaged metric" on an "averaged manifold". This ultimately al- 
lows us to make physically clear statements regarding the backreaction, which is difficult to do in 
Buchert's scheme as it stands. While Zalaletdinov's scheme has not received the same amount of 
attention as Buchert's, there has been a series of very interesting results derived in this framework 
by Zalaletdinov and coworkers |53j . 

1.1.1 The "Special Observer" assumption 

The idea of using inhomogeneities to explain the dark energy problem has generated a fiurry of 
research in the backreaction problem in recent years, as we saw above (see also Ref. [53]). It is 
important to also mention another approach which has gained popularity in this context, namely 
that of ascribing the dark energy phenomenon to light propagation effects in an inhomogeneous 
universe [55] . The central idea here is that light propagation through an inhomogeneous underden- 
sity or "void" can be significantly different from that in a homogeneous space. In fact, it is possible 
to show that luminosity distance data from supernovae can always be fit by modelling ourselves as 
observers in a void with a suitable density profile. Typically however, the (usually spherical) voids 
invoked for this purpose are very large (in the range of ~ 200/i~iMpc to ~ Ih-^Gpc in diameter), 
and are difficult to reconcile with the typical sizes of voids seen in galaxy surveys, which are in 
the range of 30-50/i-^Mpc, with some "supervoids" reaching ~ lOOh ^Mpc [56]. Nevertheless, this 
idea has been rather vigorously investigated in the last several years. Since this thesis will not 
directly deal with this approach to the dark energy problem, we will simply point the reader to 
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a list of references |57j dealing with the study of light propagation in an inhomogeneous universe 
and of supernovae data and the CMB from the point of view of void-based observers. Unlike the 
backreaction issue which requires mainly theoretical work, a detailed description of the inhomoge- 
neous universe belongs squarely in the regime of observational cosmology (5^. Due to the obvious 
observational difficulties involved in such a program (for example due to the lack of homogeneous 
samples of galaxy data), this approach at present is largely restricted to being an exercise in build- 
ing toy models of the local large scale structure [S^ . As a final comment on this topic, we note 
that this "non-Copernican" approach (even at the level of building toy models) is amenable to 
observational verification or disproof in the coming generation of surveys, as pointed out by Ref. 

m- 

With this introduction, our main results (the thesis of the thesis!) are : 

• Although technically possible, in the real world backreaction does not significantly affect the 
expansion history of the universe. 

• Cosmological perturbation theory is stable against backreaction effects, well into the nonlinear 
regime. 

• Dark energy cannot therefore be an effect of the backreaction of inhomogeneities. 
The outline of this thesis is as follows : 

Chapters 2 and 3 will deal respectively with Buchert's and Zalaletdinov's averaging schemes. 
We will first describe Buchert's scheme in chapter 2 and show using a toy model of a spherically 
symmetric inhomogeneous spacetime (the Lemaitre-Tolman-Bondi or LTB solution [61]), how an 
averaged effective description of the spacetime can have an accelerating scale factor even when 
the underlying exact solution has no exotic elements. This calculation will be based on Paranjape 
and Singh, CQG (2006) (henceforth Paper 1). We will then turn to Zalaletdinov's formalism in 
chapter 3 and give a pedagogical introduction to his 4-dimensional covariant averaging scheme and 
the derivation of the equations in his effective theory of Macroscopic Gravity (MG). We will then 
specialize this formalism for use in cosmology and emphasize the need for a spatial averaging limit 
of this averaging. By explicitly writing out the backreaction terms in a 3 -|- 1-splitting of spacetime, 
we will also be in a position to give a detailed comparison between Zalaletdinov's formalism and 
Buchert's spatial averaging, and to demonstrate the physical relevance of the backreaction terms. 
This part will be based on Paranjape and Singh, PRD (2007) (henceforth Paper 2). 

In chapter 4 we will use the spatial averaging limit of Zalaletdinov's scheme in the setting of 
cosmological perturbation theory. We will show how the leading order contribution to the backre- 
action can be calculated in a gauge invariant manner, and derive expressions for the backreaction 
which are valid whenever the metric of spacetime can be written as a perturbation around the 
FLRW form. We will also discuss the issue of self-consistency of the backreaction calculation, and 
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propose an iterative scheme to calculate the backreaction. As concrete examples we will perform 
the first iteration in such a process, for some well-studied models of perturbative inhomogencities 
in the linear regime. This set of calculations will be based on Paranjape, PRD (2008) (henceforth 
Paper 3). 

Chapter 5 will deal with the regime when matter fluctuations have become large, so that linear 
perturbation theory no longer holds. We will use the LTB solution once more to describe a semi- 
realistic situation of spherical collapse, which we will follow well into the nonlinear regime. Initially 
working in the Buchert formalism, we will emphasize the importance of correctly accounting for 
boundary conditions when building models, by showing how spurious results can be obtained for 
the averaged expansion, by ignoring boundary conditions. We will also show how the late time 
nonperturbative behaviour in our LTB model can be recast in the perturbed FLRW form, by a 
straightforward coordinate transformation, provided matter velocities remain non-relativistic. This 
will demonstrate in a clear and unambiguous manner, how a perturbation theory in the density 
contrast 6 at early times becomes a perturbation theory in peculiar velocity v at late times. These 
results will be based on Paranjape and Singh, JCAP (2008) (henceforth Paper 4). Finally, to 
complete the picture, we will apply the formalism developed in chapter 3, to the toy model described 
above, and explicitly show that the backreaction in the nonlinear regime of structure formation does 
remain small in this model. This calculation will be based on Paranjape and Singh, PRL (2008) 
(henceforth Paper 5). 

We will conclude in chapter 6 with a summary and a brief comparison with other work in the 
literature. Additionally, in the Appendices we have collected some results which will be used in the 
main text. Appendix A describes the homogeneous and isotropic FLRW cosmology and serves to 
fix notation. Appendix B describes the LTB metric which is a solution of the Einstein equations 
sourced by a spherically symmetric pressurcless fluid. Appendix C contains proofs of some results 
quoted in chapter 2. Throughout this work we shall set the speed of light c to unity, and use 
the metric signature ( — h ++). Lowercase Latin indices a,b, ...i, j, ... will take values 0,1,2,3, 
while uppercase indices A, B, ...I, J, ... will take spatial values 1,2,3. The Hubble constant Hq is 
parametrized when necessary in usual astronomers' units as Hq = 100/ikms~^Mpc~^. 
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Chapter 2 



Averaging schemes : Buchert's spatial 
averaging 

This chapter and the next describe the averaging schemes we will use for the backreaction calcula- 
tions. In this chapter we deal with Buchert's spatial averaging scheme and show in a toy example 
how a large, acceleration-inducing backreaction can arise even in the absence of any exotic dark 
energy. 

2.1 Buchert's spatial averaging of scalars 

The most straightforward and intuitively clear application of Buchert's spatial averaging is in the 
case when the matter source is a pressureless "dust" with an energy-momentum tensor T"^ = pu^-u^, 
with u"" the dust 4- velocity which satisfies Uau" = — 1- Assuming further that the dust is irrotational, 
the 4- velocity will be orthogonal to 3-dimensional spatial sections and the metric can be written in 
"synchronous and comoving" coordinates (in which u"" = (1,0)) [62] as, 

= _^^2 ^ hAB{t,x)dx^dx^ . (2.1) 

The expansion tensor is given by 0^ = ( 1/2) /i^*^ /ice where the dot refers to a derivative with 
respect to time t. The traceless symmetric shear tensor is defined as = - (e/3)<5^ where 
e = is the expansion scalar. The Einstein equations can be split into a set of scalar equations 
and a set of vector and traceless tensor equations. The scalar equations are the Hamiltonian 
constraint (|2.2ap and the evolution equation for Q ()2.2bp . 

^^'^n + -Q^-2a^ = l6TTGp, (2.2a) 
3 

(^)7^ + G + = 127rG/9 , (2.2b) 
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where ^^^TZ is the Ricci scalar of the 3-dimensional hypersurface of constant t and o"^ is the rate of 
shear defined by = {l/2)a^a^. Eqns. (j2.2ap and ()2.2bp can be combined to give Raychaudhuri's 
equation 

e + ^Q^ + 2a^ + ATrGp = 0. (2.3) 

The continuity equation p = —Qp which gives the evolution of p, is consistent with Eqns. (|2.2ap . 
()2.2bp . We only consider the scalar equations, since the spatial average of a scalar quantity can 
be defined in a gauge covariant manner within a given foliation of spacetime. For the spacetime 
described by (|2.ip . the spatial average of a scalar ^{t,x) over a comoving domain D at time t is 
defined by 

{q,)jj = A- [ d^xVh^, (2.4) 



where h is the determinant of the 3-metric hj^B and Vd is the volume of the comoving domain given 
by Vj) = (fix^/h. The following commutation relation then holds [iQ] 

{^)v-{^)v = {'^Q)v-{'^)v{Q)v. (2.5) 
which yields for the expansion scalar 

(e)p-(G)^ = (e2)^,-(e)|,. (2.6) 

Introducing the dimensionless scale factor a-o = (Vv/V-Din)^^'^ normalized by the volume of the 
domain D at some initial time tin, we can average the scalar Einstein equations (|2.2ap . ()2.2bp and 
the continuity equation to obtain [40] 



fflo \ ^ 8ttG , , 1.^ 

— =^{p)v--{Qv + {n)v) , 2.7a 
ajy J 3 b 

dx> \ 47rG , , 1 ^ r,, N 

— = ^(p)i?+oQ2?, (2.7b) 

ax> / 3 3 

{p)v = -{Q)v{p)v= -^—{p)v. (2.7c) 

Here (T^)©, the average of the spatial Ricci scalar ('^^'T^, is a domain dependent spatial constant. 
The 'backreaction' Qx) is given by 

Qv^\{{e^)v-{Q)l)-2{a^)v, (2.8) 

and is also a spatial constant. The last equation (j2.7cp simply reflects the fact that the mass 
contained in a comoving domain is constant by construction : since = 5t In the local 
continuity equation p = —Qp can be solved to give p^/h = poVho where the subscript refers 
to some arbitrary reference time Iq. The mass Mj) contained in a comoving domain D is then 
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Mv = J-jj pVhd^x = fjy po\/Tiod^x = constant. Hence 

{P)v = j^ (2.9) 

which is precisely what is imphed by Eqn. (j2.7cp . Equations ()2.7ap . ()2.7bp can be thought of as 
"modified Priedmann equations" (compare Eqns. (jA.Sh ). with the modifications arising due to the 
presence of the backreaction and the fact that the averaged Ricci curvature in general need not 
evolve like ~ as in the FLRW case. 

A necessary condition for ()2.7bp to integrate to (j2.7ap takes the form of the following differential 
equation involving Qx> and ( TZ )x> 

Qv + 6—Qv + {ny^ + 2^{n)v = o, (2.10) 

av av 

which is a very interesting equation because it shows that the evolution of the backreaction is 
intimately tied to that of the average spatial curvature. Scaling solutions for this equation have 
been explored by Buchert, Larena and Alimi [50j, a simple example being {TZ)v oc , Qv oc a^^. 
Clearly the FLRW solution with Qv = is a special case. In this thesis we will mainly be con- 
cerned with the behaviour of the backreaction arising from explicitly averaging an inhomogeneous 
spacetime, and will therefore not discuss these scaling solutions which make no reference to the 
underlying inhomogeneous geometry. We note that the criterion to be met in order for the effective 
scale factor av to accelerate, is 

Qv>4ttG{p)v (2.11) 

Equations (j2.7p and (j2.10p describe the essence of Buchert's averaging formalism, for the simplest 
case of irrotational dust. We note that the remaining eight Einstein equations for the inhomoge- 
neous geometry, which are not scalar equations, are not averaged. These are the five evolution 
equations for the trace-free part of the shear, 

dt {ai) = -Qai -ni + hi {a^ - + 8vrGp) . (2.12) 

and the three equations relating the spatial variation of the shear and the expansion, 

^B\\A = \'^\\B- (2-13) 

Here, IZ^ is the spatial Ricci tensor and a || denotes covariant derivative with respect to the 3- 
metric. Later, when we apply Zalaletdinov's formalism to cosmology, we will see that accounting for 
the full set of Einstein's equations when performing the averaging, leads to additional constraints 
on the inhomogeneous geometry, which are not present in Buchert's formalism. 

For completeness, and to enable a fuller comparison with Zalaletdinov's formalism, we also 
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display the results of applying Buchert's averaging to the case of a perfect fluid with energy- 
momentum tensor T"''' = {p + p)u°'u'' + pg"'^ |41j . In this case comoving coordinates are not in 
general synchronous, and for an irrotational perfect fluid in comoving coordinates, the metric takes 
the form 

ds^ = -N^dt^ + hABdx^dx^ . (2.14) 
The averaged scalar Einstein equations for the scale factor aj) are 

2,^ + A7:G{N^{p+'ip))v = Qv + Vv, (2.15) 
av 

6Hl-lQ7TG{N^p)v = -Qv-{N^n)v ; Hv = ^ , (2.16) 

av 

where the kinematical backreaction Qv is given by 

Qv = l[{ (NQf )v-{Ne )l) - 2{ N^a^ )v , (2.17) 

and the dynamical backreaction Vv is given by 

Vv = {N^A)v + {&dtN)v, (2.18) 

where A = V j{u'^'ViU^) is the 4-divergence of the 4-acceleration of the fluid. Eqn. ()2.16p follows as 
an integral from Eqn. ()2.15p if and only if the relation 

dtQv + GHvQv + dt{N^n )v + 2Hv{N^n )v + ^HvVv 

-lQT:G[dt{N'^p)v + 2>Hv{N'^ {p + p))v] =0, (2.19) 

is satisfled. There are also the unaveraged equations (which we do not display here) for the shear, 
analogous to the shear equations (|2.12p and (j2.13p for dust. 

2.2 Acceleration from averaging 

In this section we use the spherically symmetric dust solution of Einstein's equations (the Lemaitre- 
Tolman-Bondi or LTB solution described in Appendix B) to construct an explicit example where 
Buchert's effective scale factor accelerates even though the underlying model has no cosmological 
constant or dark energy. Our model contains a single underdense and "curvature-dominated" region 
(in a sense to be described below), whose evolution we study at late times. 
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2.2.1 Late time and curvature dominated unbound models 

Consider the LTB metric (jB.ip 

ds"^ = -dt^ + ^rV^dr^ + R^(t, r) (dO^ + sin^ ed())'^) , (2.20) 

1 — k(r)r'^ ^ ' 



which satisfies 



^"^'^^)=^-^^-^^^y^ (2.21) 

for the specific case A;(r) < in the region of interest, so that the solution is ()B.4aP 

GMir) , , , , , GMir) , , , r , , ^ 

R = — , , , „ (cosh 7/ — 1) ; t — to[r) = -7- (smhr/ — rj) , < r] < 00 , for kir) < 0. 

-A;(r)r^ (^_k{r)r^y'^ 

(2.22) 

Although it is straightforward to numerically evaluate R{t, r) for any given choice of the free 
functions, we would like to try and analytically simplify these expressions as far as possible. Since 
R{t, r) in the unbound case is an increasing function of time for arbitrary k{r) < 0, Eqn. (|2.2ip 
shows that the late time solution (after neglecting the 1/R term in the equation) can be expressed 
as 

R ~ rV-A;(r) (t - to(r)) , (2.23) 

[This leading order solution can also be derived from an asymptotic expansion of the solution ()2.22p 
for large t], see below.] The function to(^) can be obtained using the closed form expression for 
t{R) obtained by integrating Eqn. ()2.22p |65j and the scaling Rin{r) = r, as 

to(r)=tin-rQ'^VQ ; F{x) ^^{l + xf'^ - ^smh'^ (x^'^) , (2.24) 



where we have defined 

2GM{r 
-k{r)r 



9{r)^^-^^. (2.25) 



If we further assume that the matter contribution encoded in M(r) is negligible compared to that 
of the spatial curvature as encoded in ^(r), i.e. if \g[r)/r\ <C 1, then the expression for to(^) also 
simplifies at the leading order tc0 

to{r)^ tin- (2.26) 
^/-k{r) 

The leading order unbound LTB solution in this late time, negligible matter limit is then given by 



R{t,r) = r Xr + y^-k{r) {t - XtUn) , (2.27) 



'^We are assuming that g{r)/r remains finite at all r. In particular as r — > this implies k{r — > 0) ~ r**; p < 
(see Eqn. ([B2J). 
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where we have introduced two placeholders and Xt (ultimately set to unity) which will remind us 
that we are working with a late time solution with large t. Note that Eqn. (j2.27p (with \r = \t = 1), 
is the exact solution Eqn. ()2.2ip in the special case M(r) for all r. This actually corresponds 
to Minkowski spacetime, with the corresponding Riemann tensor being exactly zero. The constant 
time 3-spaces are hypersurfaces of negative curvature, with the 3-curvature being determined by 
the function k{r). The 'FLRW limit of this solution is in fact the Milne universe; the solution 
(|2.27p could hence be thought of as the 'LTB' type generalization of the Milne universe. Although 
we will use this form of the solution to draw conclusions regarding acceleration of Buchert's ajy, 
we will later argue that these conclusions are not altered by the presence of a nonzero but small 
amount of matter. Although it may appear at this stage that requiring M — > renders the late time 
approximation redundant, we will see below that additionally imposing the late time approximation 
allows us to write down a fairly straightforward sufficient condition for acceleration of ajy, which 
would not be possible with only the M ^ condition. 

For the metric (j2.20p . the volume of a spherical comoving domain of radius rx> is 

Vv = A7r dr . (2.28) 

Substituting the solution (|2.27p in this expression, we find 

Vd = {t — Xttinflk + Ar(t — XtUnj'^Ikr + X'^{t — Xttin)Ikr^ + A^X^a ; (2.29) 

where we have defined the domain dependent integrals 

rrx, ^r^(_fcr2)' ^ . (-kr^y , 



The sum of the exponents of Xr and Xt in each term in (j2.29p indicates the relative order of that term 
with respect to the leading term. This approach of treating some terms as small compared to 
others is valid since the various integrals which multiply the powers of t, are all finite and non-zero. 
Expanding Vd in powers of At, A^, we find for the effective scale factor, 

^ _ 2 / ^ 2A| / J_ A ^ (2.31) 

where O (3) represents terms involving A™ A" (i.e. containing (1/^"*+")) with m + n > 3. 

We see that the generic late time (i.e. t — > oo) behaviour of the unbound models under 
consideration is dx> 0, and that deviations from zero are small, being a second order effect. 
Whether the approach to a© = is via an accelerating or decelerating phase, depends upon the 
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Figure 2.1: The models described by fc(r) 
against a for specific values of r-p- 
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-1/(1 + r"). (a) The scaled function V jTk- (b) V jTk plotted 



relative magnitudes of the domain integrals involved. A sufficient condition for an unbound model 
with negligible matter to accelerate at late times, is 



As an explicit example, consider a model with k{r) given by 

1 



(2.32) 



k{r) 



< a < 2, 



(2.33) 



in arbitrary units. The condition < a < 2 ensures that the regularity conditions of Appendix 
B are satisfied. The function V /Ik for these models, which controls the magnitude of the late 
time acceleration (see Eqn. (j2.3ip ) is shown in Fig. 12.11 against and a. For clarity, in the 
second panel we have shown V /Ik against a for specific values of rx>- We find that V /Ik is positive 
everywhere in the region shown. To explicitly demonstrate acceleration, we plot the evolution of 
the dimensionless quantity qx> defined by 



qv = 



avav 



a. 



V 



,VvVv 



(2.34) 



for various fixed values of a and rx>, using the full expression for in (j2.29p . The results are 
shown in Fig. 12. 2[ We have used units in which tj„ = 1, and have displayed the evolution for times 
t > WO tin. 

Even though the acceleration condition (j2.32|) is strictly derived for the case M(r) = 0, one can 
easily see that it remains valid at late enough times even in the presence of a small amount of matter. 
Introducing another place holder e to keep track of the smallness of the function g(r) defined in 
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Figure 2.2: Evolution of q-D in the models with k{r) = — 1/(1 + r"), plotted against t/tin for (a) three values 
of a with r-p = 250, and (b) three values of r-p with a = 1. 



Eqn. (j2.25p . one sees that by waiting long enough, the solution for R{t,r) will be approximately 
given by Eqn. (j2.23p . with tQ{r) given by Eqn. (j2.24p . The expression for the volume Vxy will have 
the same form as Eqn. (I2.29[) . but the integrals involved will be different due to the presence of 
terms involving e 7^ 0. A condition similar to (j2.32p . say P(e) > 0, will then be obtained for late 
time acceleration, with different integrals involved in the definition of the functional 'P(e). The 
point to note is that V{e) = V+ terms containing e with V defined as in Eqn. (j2.32p . and for small 
enough e, V{e) will be positive whenever V is positiv^. Hence the acceleration condition is robust 
against adding a small but nonzero amount of matter. 



Chapter summary and discussion: 

This chapter dealt with details of Buchert's scheme for spatially averaging scalar quantities, and the 
effective cosmological equations it leads to. Using exactly solvable toy models of inhomogeneities, 
which were explicitly averaged using this scheme, we saw how an accelerating effective scale factor 
can arise even in situations where there is no exotic matter component. 

However, the fact that Buchert's scheme deals with only two of the ten Einstein equations, 
makes it difficult to relate the effective scale factor a-p with observations. In particular, it is 
not clear whether or not a-p should replace the usual scale factor in the FLRW metric. We see 
this difficulty of interpretation as arising from the inherent non-covariant structure of Buchert's 
averaging scheme. To get around this problem we will study a different averaging scheme in the 
next chapter, namely Zalaletdinov's fully covariant Macroscopic Gravity. This scheme will allow 
us to deal with objects which are structurally similar to Buchert's scale factor and backreaction, 
while being easier to interpret in a physically clear manner. 



^More precisely, we will have 'P(e) ~ V + 0{e\ne). 
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Chapter 3 

Averaging schemes : Zalaletdinov's 
covariant Macroscopic Gravity 



In this chapter we turn to the averaging defined by Zalaletdinov |43j . which is a 4-dimensional 
generaUy covariant procedure. This averaging is used on the Einstein equations and, together 
with some additional assumptions, leads to what Zalaletdinov has called Macroscopic Gravity 
(henceforth MG). After introducing MG, we will describe its spatial averaging limit as discussed in 
Paper 2. 

3.1 A covariant averaging scheme 

The starting point in any covariant averaging scheme has to be the question : "How does one average 
tensors while retaining their transformation properties under coordinate changes?" If the averaging 
operation is to involve an integral over a spacetime region, then clearly only scalar objects can be 
averaged, since they change only trivially under coordinate transformations. To define a scheme 
for general tensors then, it is essential to introduce some additional structure in the formalism. 
The most convenient option is to introduce a bivector W^'{x',x) which transforms as a vector at 
event x' and as a co-vector at event x. In Zalaletdinov's scheme one postulates the existence of 
such a bivector, requires it to have certain desirable properties, and then explicitly constructs an 
object which has all these properties. We will see how this is done in what follows. [Throughout 
this chapter primed indices (e.g. P"') will refer to "primed events" {x') and unprimed indices to 
unprimed events.] 

To begin with, we require that this bivector be idempotent (i.e. "square to itself") 





and have the coincidence limit 



X' 



lim W/(x',x) = 



(3.2) 
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This ensures that {x',x) has the inverse operator }Vj,{x,x') (which is easily seen by taking the 
X ^ x' hmit in Eqn. (j3.ip and using the condition (|3.2p ). The bivector is then used to define the 
"bilocal extension" of a general tensorial object (denoted by an overtilde) : for a vector P°'[x) this 
takes the form 

P'*(x', x) = Wl, (x, [x') , (3.3) 

with the obvious generalisation to higher rank objects. Notice that the bilocal extension as defined 
above transforms like the original tensor at the event x, but as a scalar at the event x' . This allows 
us to define the "average" of P"(x) over a 4-dimensional spacetime region S with a supporting 
point X, as 

P''{x) = {P-)sT = ^jj^x'^P%x',x) ■ Vj, = jj\'^', (3.4) 

the subscript ST standing for 'spacetime'. While this averaged tensor has the correct transformation 
properties at the event x, in order to be a local function of its argument, one needs to ensure that 
P"'{x) has appropriate differential properties. Since the x dependence of this object arises not only 
from the explicit appearance of Wj'(x',x) but also through the dependence of the domain 5] on 
the support point, in order to correctly calculate the derivative of P°-{x) we need to specify how 
neighbouring domains are related to each other. 

This is done as follows : The same bivector W"' is used to specify a Lie dragging of the averaging 
region 5], ensuring that the volumes of the averaging regions constructed at nearby supporting 
points are coordinated in a well defined manner (which motivates the terminology "coordination 
bivector" for W"', which we will follow henceforth). Suppose x" and x" + ^'^AA are the coordinates 
of two support points, where AA is a small change in the parameter along the integral curve of a 
given vector field Symbolically denote the two points as x and x + ^AA. Then the averaging 
region at x + ^AA is defined in terms of the averaging region 5](x) at x, by transporting every point 
x' € S(x) around x along the appropriate integral curve of a new bilocal vector field S*"' defined 
as 5" (x',x) = (x', x)^-^ (x), thereby constructing the averaging region S(x,AA) with support 
point X + ,^AA. 

We can now evaluate the Lie derivative of P"(x) along the vector field by first noting that 
the Lie derivative of the volume l^s(a;) is 

= r(x)(W{^ .,)5T^SW , (3.5) 

where the semicolon denotes a covariant derivative. An easy way to see this is to note that since 
S(x,AA) = |/ I / = x'^' + 5'''(x',x)AA ; x' G S(x)} , (3.6) 



17 



or symbolically y' = x' + S'AA, we will have 

y^^=y^^(l + AAS'^'5,.lny^^) ; A' = (l + AA^.^S'^') , (3.7) 
and the result (13. 5|) follows from writing 



and using S""' = W^'.^''. For the derivative of we need = x + ^^AA). Explicitly we have 

P'^ix + ^AA) = — i / A'v'^ffM^^ly', ^ + ^ AA) , (3.9) 

which after writing y' = x' + SAA with x' G 5](x) and using <\2>.7\) . finally gives 

-^P" = lim -^{P''{x + i/\X)- P^ix)) 
d\ AA^o AA ^ ^ 



= C''(x) [(^;,P'^)sr + {y^i,yP'')sT - {Wi,j,)sTP''\ , (3.10) 

where we have defined the "bilocal partial derivative" 

3b = db + W{df . (3.11) 

Since the vector field is arbitrary, Eqn. (j3.10p gives us an expression for the partial derivative 
dbP"^ (recalling that dP°-/d\ = S^^dbP"" — P^di,^"-). Requiring that partial derivatives of P"(x) 
commute, leads to the following condition 

d^bdc]P'' = {B[bBc]nST + (^^'^^[6W5';fc')5T - (%W5';fc.)5TP'^ = . (3.12) 

Straightforward algebra shows that 

B^bBc]P'' = {B[bW'i^)dyP\ (3.13) 

^[h^%,k' = (^[6^5');^' , (3.14) 

and hence the necessary and sufficient condition for (j3.12|) to hold, is 

^?[,W5' = Wf,;,] + = , (3.15) 

where underlined indices are not antisymmetrized. 

At this stage it is convenient to re-express results in the language of differential forms, since 
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it will make the algebra that follows concise and readable. The original papers by Zalaletdinov 
introduce a full-fledged bilocal exterior calculus, involving (k, r)-forms which are fc-forms at x and 
^-forms at x' . In what follows however, we will almost exclusively only need to deal with (A:,0')- 
forms that are differential forms at a single event x, although they may be bilocal in their functional 
dependence on events x and x'. This is because we will only deal with bilocal extensions of local 
A;-forms, which are defined to be (A;, 0')-forms. For example, the bilocal extension of a 2-form a is 
defined as 

a{x',x) = ^aj,k'{x')W({x',x)W'^'{x',x)dx'' Adx\ (3.16) 
which is a (2, 0')-form. We define the bilocal exterior derivative as 

d = d + d;v' (3-17) 

where d is the usual exterior derivative which "differentiates and antisymmetrizes at x" , and the 
"shifted" exterior derivative "differentiates at x' but antisymmetrizes at x" so that, say for the 
(2,0')-form a, we have 

d'yy5 = ^aabj'ix',x)W{{x',x)dx'' A dx"" A dx** . (3.18) 

For a bilocal function f{x,x') we will have d/ = Baf^x"". In this language, the preceding results 
for differential conditions on the coordination bivector can be generalised to arbitrary tensor- valued 
fc-forms. In particular for a vector-valued /c-form p"(x) whose bilocal extension is the (A;,0')-form 
p'^(x', x), we have 

dr = ((Ip")5T + (div.W A r)sr - (div,W)5T A p'^ , (3.19) 
where we have defined, 

div.W = W{,,dx'' 

= + W^adj' In \/^) dx"^ . (3.20) 

The condition (j3.15p for the averaged object to be a local function of its argument, reduces to 

dW^" = ; W-'" = Wl'dx'' , (3.21) 
Eqn. (j3.19p shows that it is desirable to choose a coordination bivector which satisfies 

diveW = , (3.22) 
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since firstly, this allows us to commute the exterior derivative with the averaging according to 

dp" = (dp")sr, (3.23) 

and secondly, it implies that the volume of the averaging region is held constant during the coor- 
dination (see Eqn. ()3.5p l. and is therefore a free parameter in the formalism. 

Mars and Zalaletdinov [S] show in their Theorems 1, 3 and 4, that firstly, the general solution 
of Eqn. (j3.2ip for an idempotent coordination bivector is given by 

Wfix',x)=fi{x')f-Y{x), (3.24) 

where f^{x)da = fm is any vector basis satisfying the commutation relations 

[^^,{j] = C'ij^k ; C^- = constant , (3.25) 

and secondly, that Eqn. (j3.22p with the coordination bivector given by Eqn. (|3.24p is always 
integrable on a differentiable manifold with a given volume n-form. The proofs given in Ref. [H] 
are very clear, although somewhat lengthy, and we will hence omit them here. Further, these 
authors also show that for the special class of bivectors for which C^j = 0, the vectors {f^} form a 
coordinate basis, with 'proper' coordinate functions (jf^{x) say, so that 

and satisfying Eqn. (13.22P makes this proper coordinate system volume preserving, with g{(p"^) = 
constant. When expressed in terms of such a volume preserving coordinate (VPC) system, the 
coordination bivector takes its most simple form, namely 

>V/(X',X) |p,oper=<5/. (3.27) 

Volume preserving coordinates in fact form a large class in themselves, generalizing the Cartesian 
coordinate system of Minkowski spacetime. For a discussion on the properties of VPCs and the 
associated bivectors Wi , see Sec. 8 of Ref. [33]. The problem of defining an averaging operator 
has now been reduced to the far simpler problem of choosing a specific VPC system which then 
fixes the coordination bivector. We emphasize that the averaging is still fully covariant; choosing 
a coordination bivector is distinct from choosing a coordinate system to perform calculations in. 
This freedom in defining the coordination bivector leads to a lack of uniqueness of the average in 
the formalism as it stands. We will return to this issue when we study the backreaction arising 
from perturbative inhomogeneities in chapter 4. 



20 



3.2 The averaged manifold 



With the averaging operation in place, we can turn to the description of an averaged geometry. 
Let us begin by recalhng some standard results from differential geometry. Given a differentiable 
manifold Ai endowed with a metric gab of Lorentzian signature ( — h ++), the connection 1-forms 
cj'^j, are defined by the action of the exterior derivative on the basis vectors |66j : 

de, = eau;% = e, (F^J dx^ (dx^ de^) = lj% , (3.28) 

where F^^ are the Christoffel symbols and the parentheses represent the inner product with (dx", e^) = 
6'^. We define the "exterior covariant derivative" D^^ associated with the connection as follows 
: for a A;-form p^, 

D^p^ = dpt - u\ A p? + A p^ , (3.29) 

with an obvious generalisation to higher rank objects. The compatibility between the metric and 
the connection on ^A is expressed by the condition 

Da;5ab = ^Qab " Qak^^^b " 9bk<^^ a = ; (3-30) 

with a similar condition for the inverse metric g""^ . The Cartan structure equations are given by 

A dx^ = , (3.31a) 
du:\ + u\^u\ = v\, (3.31b) 

where Eqn. (j3.31ap expresses the symmetry of the connection cj"^^ and r"^^ is the curvature 2-form 
on M. which defines the Riemann curvature tensor via r'^j, = (l/2!)r'^^^^dx^ A Ax'^. Finally, the 
structure equations (j3.3ip and the metric compatibility condition (j3.30p are supplemented by their 
respective integr ability conditions, given by equations (j3.32ap . (|3.32bp and (j3.32cp 

r'^b A dx^ = , (3.32a) 
dr"b - u\ A r"^ + o;"^ A r^^ = , (3.32b) 
5afcr\ + 5fefcr\ = 0. (3.32c) 

Note that Eqn. (j3.32ap corresponds to the cyclic identity r^j^^^j = and Eqn. (|3.32bp to the 
Bianchi identity ^'^^fjj.^] = 0. 

Consider now the bilocal vGrsion of the basis vector Gq^, given by Wq = Wa Gji , which is a scalar 
at X and a vector at x' , and whose inverse is the (0, l')-form W^^^ = W^,dx^ . The construction 
of an averaged manifold begins with the observation that the (l,0')-form defined by 

n\{x,x') = {W-^'',dW,) , (3.33) 
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transforms like a connection under coordinate transformations at x and, equally importantly, as a 
scalar at x' , and further has the coincidence limit 



X' 



hm n\ix,x') = u:%ix). 

r' — ^nf 



(3.34) 



Explicitly, we have 



n\{x,x') = \w'^,{x,x')w{ix,x)W'',\x',x)ryf^,{x') + W}{x,x')3byVi!{x'^ dx'' 
= T'^ci^, x')dx^ , 



(3.35) 



where F^^ is symmetric in (be) due to the condition ()3.2ip or ()3.15p . This explicit form can be 
used to directly check the transformation properties at x and x' , and d"'j^{x,x') can therefore be 
considered as the bilocal extension of the connection a^"j^(x). 

The transformation properties of f2"^ imply that its average Cl^-^^x), 



has the transformation properties of a local connection, and the coincidence limit (j3.34p implies the 
limit limv5,^o ^""bi^) ~ This leads to the key idea of MG, which is that fl"^, is defined as 

the connection 1-form on a new, averaged manifold 7W. As a set, A4 is identical to Ai; however for 
consistency in the definition of averaged quantities, the guiding principle one adopts is that each 
averaging domain S is effectively treated as a single point on M. This ensures, for example, that 
the averaging operation is idempotent [44j. 

The goal now is to average out the bilocal extensions of the structure equations (j3.3ip and 
the compatibility condition (j3.30p and their integrability conditions (j3.32p . and to express them in 
terms of appropriate differential forms defined on M. The bilocal extensions of Eqns. (j3.3ip and 
(|3.30p . are respectively given by 



where, in the last equation, Dq is the bilocal covariant exterior derivative associated with the 
bilocal connection fi"^. The integrability conditions of Eqns. (j3.37p are given by the bilocal 



(3.36) 



ft\ A dx^ = , 

gab = ^Qah - Qak^^b " = , 



(3.37a) 
(3.37b) 
(3.37c) 
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extensions of Eqns. (j3.32p . 

A dx'' = , (3.38a) 

Dn r\ = d7\ - n% A r\ + Q'', A = , (3.38b) 

gakr\ + Qbkr^a = • (3-38c) 

In the above equations, the (2, 0')-form r"^ is the bilocal extension of the curvature 2-form con- 
structed according to the rules set out in Eqns. (j3.16p and (j3.3p . Similarly the (O,0')-form Ijab is 
the bilocal extension of the metric. To proceed with the averaging, a correlation 2-form is defined 

= {n\ A ST - n\ A . (3.39) 

The average of the curvature 2-form r^^^ on A4 is denoted R";^ = ( r"^) st , and the curvature 2-form 
on the averaged manifold M is denoted M";,, 

M"^ = + n^i^ A Cl\ . (3.40) 

Equations (|3.37ap and ()3.37bp then average out to give 

fl"^ A dx^ = , (3.41a) 
M\ = K\-Z''^%. (3.41b) 

The averages of Eqn. (j3.38ap and the identity = give us 

R^'fe A dx^ = ; R^'a = , (3.42) 

and the symmetry of fi"^ and hence of fl"^ give us 

ZVfeAdx'' = ; ZVa = 0. (3.43) 

This ensures that the curvature 2-form M.""^ satisfies the correct algebraic identities 

M"feAdx'' = ; M''„ = 0, (3.44) 

The formalism at this stage becomes somewhat complicated. The reason is that there is no simple 
way of averaging out equations (|3.37cp . (|3.38bp and (j3.38cp . which become 

dffab - {gak^^blST - {9bk^^a)sT = , (3.45a) 

dR"^, - A v\)sT + A ^\)ST = , (3.45b) 
{gak^^hlST + {9bkr^a)sT = , (3.45c) 
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where gab = {gab)sT- What we need are "sphtting rules" for the various products appearing inside 
the averager in these equations. Of these, Eqn. ()3.45bp can be split by noting that the exterior 
covariant derivative of the correlation 2-form becomes 

nnZ%) = -2P Y'^J^.V + 2P {{r\ A fi^sT - A fi^) , (3.46) 

where the symbol P permutes the free indices in, say pairwise according to P(K'^f,*j'^) = 

(l/3!)(K''jj^™ — K*^°(^"^ + K*^"^"^), and any summed indices are ignored, and the correlation 3- form 
is defined as 

Y%\Z = {n\ A A n™„)sr - 3P(Z'^,^ A - n\ A A fi^^ . (3.47) 

Tracing Eqn. (j3.46p on the indices b and i kills the term involving the correlation 3- form due to 
the presence of the permutation symbol, leaving behind 

- A n\)5T + {r\A n%)sT = aCi\ + k\a n% - n^zy, , (3.48) 

which averages out Eqn. ()3.45bp to give the Bianchi identity for the curvature 2-form M'^^, 

Df^M"^ = dM'^^ - n'^j A M\ + n\ A M\ = . (3.49) 

This was achieved at the cost of introducing a new object, the correlation 3-form Y%^j"^, which 
fixes the differential properties of the correlation 2-form Z"^*^. and hence of the 2-form R"^. The 
differential properties of this 3-form are in turn fixed by introducing a correlation 4^-form in an 
analogous manner. Due to the 4-dimensionality of spacetime there are no higher correlation p- 
forms that need to be defined. 

In practice, it is cumbersome to keep track of the correlation 3-form and 4-form, and furthermore 
it is only the correlation 2-form which will appear in the averaged Einstein equations. We will 
ultimately be interested in explicit calculations of the correlation objects in a perturbative setting 
at leading order, and will hence ignore the 3-form and 4-form. Remarkably, it is also possible to 
self-consistently ignore these forms in the nonperturbative setting [43J as follows : We set the 3-form 
and 4-form to zero and impose the conditions 

DnZV, = = DnR^, (3.50) 

with the second equality required since Eqn. (j3.49p holds. It can be shown [l3] that requiring the 
4-form to vanish also imposes the condition 

p(zVdAZV,) =0. (3.51) 
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The integrability condition for DqZ"^*^. = is 

P (r", a ZV, - a R'^,.) = , (3.52) 

which also contains the integrabihty condition for D^R"^ = 0. 

So far we have only managed to average out Eqn. ()3.45bp . To make progress with Eqns. (j3.45ap 
and (j3.45cp we need additional assumptions. Zalaletdinov argues |43j that for a class of slowly 
varying tensor fields (tensor-valued fc-forms) c™;;' on M such as the metric and other covariantly 
constant tensors, and Killing tensors, etc., the following assumptions may be reasonable 

(n%Ac-::-)5T = n",Acr.-, (3.53a) 

{n\ A K^f')sT = A n',)sT Ac-;;- . (3.53b) 

Then Eqn. (j3.45ap and its analogue for g"''' average out to give 

Bagab = ; B^r' = 0- (3-54) 
Further, for a general slowly varying object c™;;', the following identity holds 

{r\AcZ:)sT - n\ A c-;;- - {n\ a cZ:)st + n\ a d^c-;;- 

= -Z'^,7AcC.---- + ZV„Ac-.- + ... , (3.55) 

which follows from differentiating Eqn. (|3.53ap . and which averages out Eqn. ()3.45cp (and its 
analogue for g"''') to give 

gakM\ + gi,kM\ = ; M V' + M V' = ■ (3.56) 

Eqn. (j3.54p allows one to choose Gab = dab, where Gab is the metric on the averaged manifold M.. 
In general however, we have G"'' 7^ g""^, and one defines the tensor = g""^ — G"'' to keep track of 
this difference. However, we shall see later that when the averaged manifold is highly symmetric, 
as in the case of a manifold with homogeneous and isotropic spatial sections which we will consider, 
one finds that ^7"''' = 0. 



3.3 Averaging Einstein's equations 

In the general case, it turns out that Eqn. (j3.55p is all that is needed to average out the Einstein 
equations 

g^'rkb - \5W'n, = -Ktf"^''^ , (3.57) 
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where k = SttGn, ^a(mic) microscopic energy momentum tensor of the matter distribution, 

and the Ricci tensor on M is defined according to the sign convention rat = ''''' ahj • averaging 
leads to the equations 

(3.58) 

where M^b = ^''abj Ricci tensor on M. and we have defined 

Z^b = '^Z'^ik^jb ; "^"^bj ~ Z°'brnjn^^^ ^ • (3.59) 

The averaged equations (j3.58p differ from the usual Einstein equations by the correlation tensor 
which we define as 

Gt = {z\^b - \61Z'%^ r - (u'^'M.b - IstU^'M,,^ . (3.60) 
Hence, denoting the Einstein tensor on M as E'^, and defining the tensor via 

n = (t'fe^""^>5T, (3.61) 

the averaged Einstein equations read 

El = -kTI + Gl . (3.62) 

Since the left hand side of Eqn. (j3.62p is covariantly conserved by construction {E'^.^ = 0), where 
the semicolon denotes covariant differentiation with respect to the connection on M, in general one 
has 

i-^Tt + Gt\^ = 0, (3.63) 

with no condition on and separately. If, however, we assume that D^Z^^j^'^. = 0, it follows 
that 

Ct;a = 0, (3-64) 

which implies that the averaged energy-momentum tensor is also covariantly conserved. In our 
explicit calculations in the perturbative setting in chapter 4, we will see that this condition does 
not hold in general. 

It can also be shown that in 4 dimensions, the 720 a priori independent components of .^'*f,m 
are subject to 680 constraints arising from Eqns. (j3.52p and (j3.5ip . This leaves 40 independent 
components which combine to give the 10 independent components of the correlation tensor C^. 
The conditions in Eqns. (j3.52p and (|3.5ip do not constrain the components of Cg, which follows 
from considering the structure of those equations. 
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3.4 A 3 + 1 spacetime splitting and the spatial averaging limit 



We are now in a position to apply the MG formalism to the problem of cosmology. The main idea we 
wish to emphasize is that in the cosmological context, it is essential to consider a spatial averaging 
limit of the covariant averaging used in MG. The simplest way to see this is to note that the 
homogeneous and isotropic FLRW spacetim^ must be left invariant under the averaging operation, 
and this is only possible if the averaging is tuned to the uniquely defined spatial slices of constant 
curvature in the FLRW spacetime. We will elaborate on this below. Our main motivation here is 
to spell out all the assumptions usually made in the standard approach to cosmology, and ask what 
they imply in the context of the averaging paradigm. It has been suggested that the observationally 
relevant averaging must necessarily be performed on the light cone [67j. This would not preserve 
the symmetries of the FLRW spacetime. Our point of view is that one needs a theoretically self- 
consistent mathematical framework in which to study cosmological expansion, structure formation, 
etc. We will be conservative in our approach and only make the minimum assumptions necessary 
in order to make standard cosmology compatible with the averaging paradigm. 

We start with the assumption that Einstein's equations are to be imposed on length scales 
-^^inhom where stars are pointlike objects, and that there exists a length scale Lflrw such that 
averaging on this length scale yields a geometry which has homogeneous and isotropic spatial 
sections. We expect Lflrw ^ 100/i~^Mpc and we assume Linhom ^ -^^flrw -^^Hubbie where 
-^^Hubbie is the length scale of the observable universe. In other words, we will assume that the 
averaged manifold Ai admits a preferred, hypersurface-orthogonal unit timelike vector field v"", 
which defines 3-dimensional spacelike hypersurfaces of constant curvature, and that is tangent 
to the trajectories of observers who see an isotropic cosmic background radiation. For simplicity 
we will work with the special case where these spatial sections on M are flat. One can then choose 
coordinates {t,x^), A = 1,2,3, on A4 such that the spatial line element takes the form 



where 6ab = 1 for A = B, and otherwise, and we have v"" = (t;*, 0,0,0) so that the spatial 
coordinates are comoving with the preferred observers. The vector field also defines a proper 
time (the cosmic time) r such that dr = v'^da = v^dt- We will further assume that the averaged 
energy-momentum tensor can be written in the form of a perfect fiuid, as 



'spatial 



a^{t)5ABdx'^dx 



.B 



(3.65) 



(3.66) 



where the projection operator vr^ is defined as 



T^b = (^b + ^ 



(3.67) 



Appendix A describes the main features of the FLRW spacetime. 



27 



and satisfies ir^v^ = = ir^Va, ''^c^b ~ ''^b^ P P homogeneous energy density and 

pressure respectively, as measured by observers moving on trajectories (in M) with the tangent 
vector field v"", 

p^Ttv'va ; P^^vr^n- (3.68) 

An important consequence of the above assumptions is that the correlation tensor C^, when ex- 
pressed in terms of the natural coordinates adapted to the spatial sections defined by the vector 
field v"", is homogeneous and depends only on the time coordinate. This is clear when the modified 
Einstein equations (|3.62|) are written in these natural coordinates. 

Using the vector field v"", the (FLRW) Einstein tensor E'^ can be written as 

= jl{x)v''Vb + j2{x)TT^ , 

h{x) ^ Etv% ; j2{x) ^ ^ {ttIei) , (3.69) 

where ji(x) and j2{x) are scalar functions whose form depends upon the coordinates used. The 
remaining components given by n'^E'^Va and the traceless part of vr^vr^S^, vanish identically. Since 
the energy-momentum tensor in Eqn. p.66p also has an identical structure, this structure is 
therefore also imposed on the correlation tensor C^. Namely, only the components C^v^Va and 
the trace vr^C^ are relevant to the dynamics of the averaged metric. The remaining components, 
namely vr^C^'Da and the traceless part of vr* tt^C^, must vanish. This is a condition on the underlying 
inhomogeneous geometry, irrespective of the coordinates used to describe the geometry on either Ai 
or A4 , and is clearly a consequence of demanding that the averaged geometry have the symmetries 
of the FLRW spacetime. 

This leads us to the crucial question of the choice of gauge for the underlying geometry : namely, 
what choice of spatial sections for the inhomogeneous geometry, will lead to the spatial sections of 
the FLRW metric in the comoving coordinates defined in Eqn. (13.65P ? Since the matter distribution 
at scale Linhom need not be pressure-free (or, indeed, even of the perfect fluid form), there is clearly 
no natural choice of gauge available, although locally, a synchronous reference can always be chosen. 
We note that there must be at least one choice of gauge in which the averaged metric has spatial 
sections in the form (j3.65p - this is simply a reflnement of the Cosmological Principle, and of the 
Weyl postulate, according to which the universe is homogeneous and isotropic on large scales, and 
individual galaxies are considered as the "observers" travelling on trajectories with tangent v'^. In 
the averaging approach, it makes more sense to replace "individual galaxies" with the averaging 
domains considered as physically inflnitesimal cells - the "points" of the averaged manifold 7W . This 
is physically reasonable since we know after all, that individual galaxies exhibit peculiar motions, 
undergo mergers and so on. This idea is also more in keeping with the notion that the universe 
is homogeneous and isotropic only on the largest scales, which are much larger than the scale of 
individual galaxies. 

Consider any 3-1-1 spacetime splitting in the form of a lapse function N{t,x'^), a shift vector 
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N {t, X ), and a metric for the 3-geometry hAB{t, x ), so that the hne element on Ai can be written 
as 

(^)ds'^ = - (Ar2 _ NaN^) dt^ + 2NBdx^dt + hAsdx^dx^ , (3.70) 

where Na = HabN^ . At first sight, it might seem reasonable to leave the choice of gauge arbitrary. 
One could then formally consider a coordination bivector given by the Eqns. (|3.24|) and (|3.26|) . 
with X* denoting the coordinates in the chosen gauge and (j)^ the VPCs; and demand for example, 
that the metric ([STTOll (with say iV^ = 0) average out to the FLRW form (with a nonsynchronous 
time coordinate in general). This would imply 

Goo = {goo)sT = -f{t) ; GoA = {9oa)st = ; Gab = {9ab)st = a^{t)5AB ■ (3.71) 

Note that the condition on the bilocal extension gQAix',x) is in general nontrivial even when the 
components goA{x) are chosen to be zero. In the Appendix (jC.ip we show that with the above 
assumptions, for a general lapse function A^, the conditions D^d""^ — (Eqn. (j3.54p ) also allow us 
to choose 

U'j = g'j - G'^ = . (3.72) 

However, it turns out that if we make the assumption that the spatial sections on M leading to the 
spatial metric ()3.65p on are spatial sections in a volume preserving gauge, then the correlation 
terms simplify greatly. This is not surprising since the MG formalism is nicely adapted to the choice 
of volume preserving coordinates. Moreover, as we will see in chapter 4, at least in the perturbative 
context a modified version of this "VP gauge assumption" in fact becomes a necessity in order to 
consistently set up the formalism. We will therefore introduce spatial averaging in MG by making 
the VP gauge assumption, and will then calculate the correlation terms and display the modified 
equations resulting from this choice of gauge. Following that calculation we will also show how the 
correlation terms can be generalized to the case where the gauge in the inhomogeneous metric is 
(formally) left unspecified. 

To begin our first calculation, we perform a coordinate transformation and shift to the gauge 
wherein the new lapse function N is given by A = where h is the determinant of the new 

3-metric, denoted Kab- In general, one will now be left with a non-zero shift vector A"^; however, 
the condition Ny/h = 1 ensures that the coordinates we are now using are volume preserving, 
since the metric determinant is given hy g = —N'^h = — 1 = constant. We denote these volume 
preserving coordinates (VPCs) by (t, x) = {i,x^) = {t,x,y,z), and will assume that the spatial 
coordinates are noncompact. For simplicity, we make the added assumption that A"^ = in the 
inhomogenous geometry [§, so that g^^ = —N"^ = —1/h and g^A = 0. The line element for the 

^We are making the assumption A*'^ = in the volume preserving gauge for algebraic convenience only - in our 
case this assumption cannot be justified by the absence of vorticity. A more detailed (and complicated) analysis 
should retain an arbitrary A^'^ in the inhomogeneous geometry, and make assumptions about its average - such as 
( Na ) = for example. 
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inhomogenous manifold A4 becomes 

(M)ds^ = -Tj^ + hAB{t,^)dx^dx^ . (3.73) 

Note that in this gauge, the average takes on a particularly simple form : for a tensor p^j{x), with 
a spacetime averaging domain given by the "cuboid" S defined by 

5] = {(t',x^') I i-T/2 < V <t + T/2,x^ - L/2 < x^' < x^ + L/2;A = 1,2,3} , (3.74) 

where T and L are averaging time and length scales respectively, the average is given by 

1 rt+T/2 rx+L/2 ry+L/2 rz+L/2 r 

{p])st{1^) = {p))sTit,^) = 7^73 / dt' dx' / dy' / dz' p]{t',x',y',z') . 

1 L-" Jt-.T/2 Jx-L/2 Jy~L/2 Jz-L/2 L 

(3.75) 

We define the "spatial averaging limit" as the limit T ^ (or T <^ ^Hubble) which is interpreted 
as providing a definition of the average on a spatial domain corresponding to a "thin" time slice, 
the averaging operation now being given by 



i'X+L/2 


l-y+L/2 


(■z+L/2 






/ dx' 


/ dy' 


/ dz' 


P)it,x',y',z') 


lx~L/2 


Iy-Ll2 


1 z-L/2 





3) . (3.76) 



(Note the time dependence of the integrand.) Henceforth, averaging will refer to spatial averaging, 
and will be denoted by (...), in contrast to the spacetime averaging considered thus far (denoted by 
(•••)sTifl- The significance of introducing a spatial averaging in this manner is that the construction 
of spatial averaging is not isolated from spacetime averaging, but is a special limiting case of the 
latter and is, in fact, still a fully covariant operation. 

For the volume preserving gauge, the averaging assumption (|3.71|) reduces to 

Gu = {9tt) = {^) = -f\t) ; GAB = {hAB) = -a\t)5AB, (3.77) 

where d and / are some functions of the time coordinate alone. A few remarks are in order on this 
particular choice of assumptions. Apart from the fact that the spacetime averaging operation takes 
on its simplest possible form (I3.75P in this gauge and allows a transparent definition of the spatial 
averaging limit, it can also be shown that the assumptions in Eqn. (|3.77p are sufficient to establish 
the following relations : 

Here the second equality arises from the condition g'^^ = G^^ which can be assumed whenever the 



^The choice of a cube with sides of length L as the spatial averaging domain was arbitrary, and is in fact not 
essential for any of the calculations to follow. In particular, all calculations can be performed with a spatial domain 
of arbitrary shape. We will only use the cube for definiteness and simplicity in displaying equations. 
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averaged metric is of the FLRW form (see Appendix (jC.ip ). The last equality follows on considering 
the conditions (r^^) ~ (FLRW)pa^ obvious notation, (the basic assumption of the MG averaging 
scheme), details of which can be found in the Appendix ()C.2p . Eqn. (j3.78p reduces the line element 
on Ai to the form 

iM)d^2 = -^ + a\t)6ABdx^dx'' . (3.79) 

The line element in Eqn. ()3.79p clearly corresponds to the FLRW metric in a volume preserving 
gauge which differs from the standard synchronous and comoving gauge, only by a redefinition of 
the time coordinate. The vector field introduced at the beginning of this section and which 
defines the FLRW spatial sections, is now given by 

^;" = (a3, 0,0,0) ; = Gabv' = (^-^,0,0,0,^ . (3.80) 

Note that is not in general the average of the vector field = (\//i, 0,0, 0) which defines the 
3 + 1 splitting on Ai, but (at least in the volume preserving gauge) is related to it by 

= (SM) 

(A simple relation such as (|3.8ip cannot in general be written for an arbitrary gauge.) 

As mentioned earlier, the spatial averaging limit of the covariant MG averaging is important 
because we want the homogeneous and isotropic FLRW geometry to average to itself. Since the 
FLRW geometry has a preferred set of spatial sections, one therefore needs to average over these 
sections. Further, since the FLRW metric adapted to its preferred spatial sections depends on the 
time coordinate, it is also essential that the spacetime average involve a time range that is short 
compared to the scale over which say the scale factor changes significantly. Clearly then, averaging 
the FLRW metric (denoted ^^^^^^ gab) given in Eqn. (j3.79p will strictly yield the same metric only 
in the limit T ^ 0. Namely, for the cuboid 5] defined in Eqn. (|3.74p 

( (^^^^^Sa. ) = lim / dM (^^^^)5a.(t', x') = , (3-82) 

which should be clear from the definition of the metric. The result {^^^^^^Qab) = ^^^^^^ dab in 
the spatial averaging limit can also be shown to hold for the FLRW metric in synchronous gauge, 
where the coordination bivector Wj' can be easily computed using the transformation from the 
VPCs {i,x^) to the synchronous coordinates (t, y"^) given by 

^ ; y^ = x^. (3.83) 

The transformation (|3.83p will also later allow us to write the averaged equations in the synchronous 
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gauge for the averaged geometry. 



3.5 The correlation 2-form and the averaged field equations 

3.5.1 Results for the Volume Preserving Gauge 

In any gauge with N"^ = 0, the expansion tensor 0;^ is given by 

ei^^h^'^dthcB. (3.84) 

The notation is the same as used in chapter 2, and the expansion scalar Q, shear tensor and 
shear scalar cj^ are defined as before. Note that 

= ^e^G^ - W ; - e^e^ = ^e^ - 2a' . (3.85) 

ZD 3 

The connection 1-forms uj^j = F^^dx'^ in terms of the expansion tensor, are listed below for an 
arbitrary lapse function : 

Lv^o = dt{\n N)dt + 9a (In N)dx^ ; = 7V^/i^^5c(ln N)dt + NQ^dx^ , 

co\ = dA{lnN)dt + ^eABdx^ ; w^j = iVS^dt + (^^F^^dx^ , (3.86) 

where ^^^F^^-, is the Christoffel symbol built from the 3- metric Hab and its inverse. Specializing 
now to the volume preserving gauge {N = /i~^/^), the bilocal extensions fi*^- of the connection 
1-forms are trivial and are simply given by 

nyx',x) = F}fc(x')dx^ (3.87) 

Since Gab = dah-, the connection 1-forms fj*^- for the averaged manifold M. are constructed using 
the FLRW metric in volume preserving gauge given in Eqn. (j3.79p . and are given by 

n% = -3M ; n^f^ = HS^dx^ , 

n% = a^HSABdx^ ; Cl^g = HS^di, (3.88) 

where we have defined 

\ da , , 

H = --^. (3.89) 

adt ^ ^ 

Using Eqns. (|3.86p with N = /i^^/^, Eqn. (j3.87p and Eqns. (|3.88p . we can now easily construct the 
correlation 2-form Z"^^ defined in Eqn. (j3.39p . For completeness, we will display all the nontrivial 
components although not all of them will be relevant for the final equations. The condition 

= h^^^' has the effect that several of the Christoffel symbols (which can be read off from 
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Eqn. (j3.86p ) become related to each other. For example, we have Fqq = — 9f(ln\//i) = —^qa ~ 
-{1/Vh)@, and so on. Denoting the spatial average of some quantity p*- as simply (p*- ), we have 



fvO 

^ a A 



70 A 

^00 



ryO A 

^ Q B 



ryO 
^ A B 



B 

^ AC 



rA B 



rjA B 
^ C 



fA C 
-' B D 



AO 



{QQaj) + {dAilnVh)djilnVh)) — Su^H^Saj 



dx'^ A dt 

{ VhQAjdKi^n Vh) )dx'' A dx^ , 



(3.90a) 



r/A 



dx"' A dt 



( ^e:^5x(ln Vh) )dx-' A dx^' , (3.90b) 



ryA 

B Q 



( (3)ri^ ) - ( ^e^9j(in\/)I) ) dx-^ a d?+ ( (3)r^^a^(in V7I) )d2;^ a dx^ , 



2(V)^5[^(ln\/ft:)eB]j) dx'^ Adf+ [{h&AjQBK) ^ cl^^H^SajSbk] dx'' Adx^ 



(3.90c) 
(3.90d) 



Zu a _ 
A — 



ryB 

A 



{^dA{\nVh)Q^j) ^ {^h^''dK{\nVh)QA.j) 



dx-' A dt 



[( e^jef ) - a^H^AjSlc] dx' A dx' 



(3.90e) 



ryB 



{^A{\T^^/h) '^^^T'^j) + {QajQc) ~^^H'^Sa.j5c\ da;^ A df + ( V^O^j ^^^rg^ )da;^ A da;^' , 

(3.90f) 

1 



'i{^dKi\r^Vh)h''^^Q^^) 



dx ' A dt 



dx"' A dx^ , 



(3.90g) 



ryB A 

'■^ C 



dx-' A dt 



yh 

dx' A dt + ( (3)r^,^ '^^rg^ )dx'^ A dx^ , 



(3.90h) 
(3.901) 



where we have used the relation ^^•'T^j = ^^(In \//i)- 
It is now straightforward to use the relations in Eqn. (I3.59[) (note the unconventional normaliza- 
tion of the 2-form) to read off the components -Z^"f,m jn ^^'^ hence perform the required summations 
to construct Z'^-jj^. This, together with the fact that = G"'' (see Appendix (jC.ip ). allows us to 
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construct the correlation tensor defined in Eqn. (j3.60p 



Ca / rya ea ym 

h — \ ijb ~ 2^ 



(3.91) 



Now, the components of the Einstein tensor E"^ for the averaged spacetime with metric (|3.79p are 
given by 



E\ = 3a^H^ ; = = S| , 



E 



B 



6xA 
B 



a^6 



2 ( - + 3i7^ 1 + 

a 



(3.92) 



where the pecuhar sphtting of terms in the last equation is for later convenience. Recall that the 
overdot denotes a derivative with respect to the VPC time i, not synchronous time. In terms of 
the coordinate independent objects introduced in Eqn. (j3.69p . we have 



ji{x) = -3a^H' ; i2(x) = 



2 I ^ + 3H^ ] + H"^ 

a 



(3.93) 



From the averaged Einstein equations in (j3.62p we next construct the scalar equations which in the 
standard case would correspond to the Friedmann equation and the Raychaudhuri equation. These 
correspond to the Einstein tensor components, 



Etv'va = ji{x) ; Tr'^Et + Etv'va = 3j2{x) + h{x 



b rna I Tpd—b- 



(3.94) 



and are given by 



Sa'^H^ = {kTI - CI) Vav" = Kp 



2 L 



6a° ( 3 + 

a 



K (p + 3p) + 2 



g(i) + g(2)+5(2) 



(3.95a) 
(3.95b) 



Here Eqn. ()3.95ap is the modified Friedmann equation and Eqn. ()3.95bp the modified Raychaudhuri 
equation (in the volume preserving gauge on M.). We have used Eqn. (j3.68p . with the overbar on 
p and p reminding us that they are expressed in terms of the nonsynchronous time t, and we have 
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defined the correlation terms 



Q(l) 



my 



(3.96a) 



5(1) = l^AB 
a"' 



( ^'^^ic ^'^rg^ ) - ( 5a (In Vh)dB{\n Vh) ; 



(3.96b) 



(3.96c) 



5(2) = a^{^h^^dA{lnVh)dBilnVh)] 



j5^^{dAi\nVh)dBilnVh)[ 



(3.96d) 



We have used the second relation in Eqn. ()3.85p in defining Q^^^ . Q^^^ and Q^^) ^j-g correlations of 
the extrinsic curvature, whereas S^^^ and 5^^) are correlations restricted to the intrinsic 3-geometry 
of the spatial slices of A^. Since the components of are not explicitly constrained by Eqns. ()3.5ip 
and (|332D . we can treat the combinations (1/2)(Q(i) + S^'^^) = -Cg and 2(Q(i) + Q(2) +5(2)) = 
(C^ — Cq) as independent, subject only to the differential constraints (I3.64p which follow if we 
assume D^Z'^j*^. = 0. We will return to these below. 

As discussed earlier, the remaining components of must be set to zero, giving constraints 
on the underlying inhomogeneous geometry. In coordinate independent language, these constraints 
read 



0. (3.97) 
Eqns. (j3.97p reduce to the following for our specific choice of volume preserving coordinates. 



; ci-l6i{Cj) = 0, 



(3.98) 
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or, in full detail, 

uJK 

-— Vhe J aOk {In Vh)) - { h^^ ){Vh@jAdKi}n Vh) ] 
n 



+ 



uJK 



0, (3.99a) 



1 



ejdx (In \^)) -{h^^){^ ejdK (in Vh) 



+ 



, (3.99b) 



{\h^^dc{\nVh)dB{\n Vh)] 



(3.99c) 

where ( /i"'^' ) = = (l/a^)^-^^. Eqns. with the choice 500 = -N"^ = -h'^ show that 

all the terms paired within square brackets in Eqns. p.99p above, as also the correlations Q*-^) and 
5(2) defined in Eqns. UiSMch and (I3.96dl) . are of the form 



(900 ) 



(3.100) 



The assumption in Eqn. (|3.53bp shows that one can write 



h^ 
h 



(3.101) 



An interesting point is that the VPC assumption = /i^^/^ also allows us to assume ( h^^ /h) = 
{h^^){\/h) consistently with the formalism (details in Appendix ()C.2p ). Using Eqn. (j3.78p this 
gives us 

(3.102) 



^ h ' aP^ 



This leads to some remarkable cancellations in Eqns. (|3.99p . and also shows that the correlation 
terms Q^^) and 5^^^ in fact vanish, 

q(2) = = 5(2). ^3^j03) 



36 



Eqns. (j3.99p simplify to give 



B ■ 
AB , 



0, 



BK , 



1 

3' 



(3.104a) 
(3.104b) 

(3.104c) 



These simplifications are solely a consequence of assuming that the inhomogeneous metric in the 
volume preserving gauge averages out to give the FLRW metric in standard form. In general, these 
simplifications will not occur when the standard FLRW metric arises from an arbitrary choice of 
gauge for the inhomogeneous metric. 

In order to come as close as possible to the standard approach in cosmology, we will now 
rewrite the scalar equations (j3.95p (which are the cosmologically relevant ones) after performing 
the transformation given in Eqn. (j3.83|) in order to get the FLRW metric to the form 



-dT^ + a\T)6ABdy'^dy' 



: a(t(r)) 



(3.105) 



Since Eqns. (j3.95p are scalar equations, this transformation only has the effect of re-expressing all 
the terms as functions of the synchronous time r. Although the transformation will change the 
explicit form of the coordination bivector VV°' , this change involves only the time coordinate, and 
in the spatial averaging limit there is no difference between averages computed in the VPCs and 
those computed after the time redefinition. This again emphasizes the importance of the spatial 
averaging limit of spacetime averaging, if we are to succeed operationally in explicitly displaying 
the correlations as corrections to the standard cosmological equations. The correlation terms in 
Eqns. (|3.96p are therefore still interpreted with respect to the volume preserving gauge, but are 
treated as functions of r. For the scale factor on the other hand, we have 



d-^H 



1 da 
a dr 



l(fa 
a dr'^ 



Further writing 
equations (j3.95p become 



p{r)=p{t{r)) ; P{r) = p{t{r)) . 



H 



FLRW 



SttGn 1 
^'--61 



Q(i)+5« 



(3.106) 



(3.107) 



(3.108a) 



l(Pa 
a dr"^ 



4ttG 



^(P + 3P) + ^Q«. 



(3.108b) 
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We emphasize that the quantities Q^^^ and S^^\ defined in Eqns. (|3.96ap and (|3.96bp as correlations 
in the volume preserving gauge, are to be thought of as functions of the synchronous time r, where 
the synchronous time coordinate itself was defined after the spatial averaging. Such an identification 
is justified since we are dealing with scalar combinations of these quantities. Note that Q^^^ and 
S^^^ can be treated independently, apart from the constraints imposed by Eqn. (j3.64p . which we 
turn to next. These conservation conditions can be decomposed into a scalar part and a 3-vector 
part, given respectively by 

v'Ct,^ = ; 4Ct,a = 0. (3.109) 
In the synchronous gauge (|3.105p for the FLRW metric, the scalar equation reads 

(a,Q(i) + Gi^FLRwQ^'^) + (5.5(1) ^ 2//flrw5(i)) = . (3.110) 

We recall that this equation is a consequence of setting the correlation 3-form and the correlation 
4-form to zero, and it relates the evolution of Q^^^ and S^^\ The 3-vector equation (on imposing 
the first set of conditions in Eqn. (j3.97p ) simply gives drC\ = 0, so that C\ = = constant, which 
also implies that C:^ = = constant and hence this equation gives nothing new. (We have used 
the relations Cq = C^, C\ = o?C\ and Cq = {l/a^^)C^ where denotes the nonsynchronous time 
coordinate t.) 

The cosmological equations (|3.108p . along with the constraint equations (|3.104p and (|3.110p 
are the key results of this section. Subject to the acceptance of the volume preserving gauge on 
the underlying manifold M they can in principle be used to study the role of the correction terms 
resulting from spatial averaging. 

3.5.2 Results for an arbitrary gauge choice 

In this subsection, we will display the results obtained on assuming that the metric 

(^)ds2 = -N^{t, x)dt^ + hAB{t, ^)dx^dx^ , (3.111) 

averages out to the FLRW metric in standard form with a nonsynchronous time coordinate t in 
general, to give 

(^)ds2 ^ -f(t)dt^ + a\t)6ABdx^dx^ . (3.112) 

In other words, we are assuming that the relations in Eqn. (j3.7ip hold. Note that the averaging 
operation is no longer trivial, although we are still assuming an averaging on domains corresponding 
to "thin" time slices. We again split the averaged Einstein equations into scalar equations, and 
3-vector and traceless 3-tensor equations. After transforming to the synchronous time coordinate 
r, now defined by 

T = J fit')dt', (3.113) 
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and again defining H = (l/a){da/dt) and -ffpLRW = {l/a){da/dT) with a(T) = a(t(r)), the modified 
Friedmann and Raychaudhuri equations read 
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where the correlation terms are now defined using the relations, 
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We emphasize that averaging here refers to spatial averaging. Also {g"^^) = G'^^ = {1/ 0^)8'^^ , 
and the index refers to the nonsynchronous time t. It is easy to check using Eqn. dnSSD, thatp(^) 
and V^^^ +'P^^-* correspond to correlations of (the bilocal extensions of) the extrinsic curvature with 
itself and with the time derivative of the lapse function. cS*^^-* corresponds to correlations between 
the Christoffel symbols of the 3-geometry, and S^'^^ to correlations of the spatial derivative of the 
lapse function with itself and with the Christoffel symbols of the 3-geometry. Due to the way we 
have defined these correlations, one can also check that when the lapse function satisfies N^/h = 1 
(so that the averaging becomes trivial), we have ^(1) = Q(i), 5(1) = S^^\ and = g = 5(2), 
where Q^^^ and iS^^-* were defined in Eqns. (j3.96p . The 3- vector and traceless 3-tensor equations 
become 
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where the lower case index m in the last equation runs over all spacetime indices 0, 1,2,3, with 
the index referring to the nonsynchronous time t. It is easy to check that Eqns. (j3.116p reduce 
to Eqns. (|3.104p with the choice = The condition C^.^ = has the scalar part, 

(d^pW + GHflkwP^^^) + + 2//flrw5(^)) + 4//flrw (^^'^ + S^^^) = , (3.117) 

while the 3-vector part, as before, gives nothing new and simply states drC\ = 0. 

We can now state the main result of this section as follows : Having assumed that the FLRW 
spatial sections arise as the average of some gauge choice with lapse function N{t,x.), spatial 3- 
metric hAB{t,^) and shift vector set to zero for convenience, we can construct the scalar 
quantities C^v^Va and vr^C^ + C^v^Va which, in coordinates natural to the FLRW metric take the 
form, 

C;^ = ^ [p(') + cSW] ; ^^C;j + = 2 [pW + + 5{2)J ^ (3,118) 

with the various quantities being defined in Eqns. (|3.115p . These scalars modify the usual cosmo- 
logical equations as shown in Eqns. ()3.114p . and are themselves subject to the differential conditions 
(j3.117p . In addition, for consistency of our assumptions with the formalism, the underlying inho- 
mogeneous metric is also subject to the conditions (j3.116p . 

The combinations on the right hand sides of the relations (|3.118p can clearly be treated indepen- 
dently, apart from the conditions (j3.117p . Further, since the correlation 2-form has 40 independent 
components Z^^^'^j^ after imposing all algebraic constraints, and since none of the four quantities 
p(2), 5« and 5(2) are trivially related by these constraints, one can always treat these four 
functions independently of each other, subject only to the constraint in Eqn. (j3.117p . In general 
this constraint may also not be satisfied, and the correlation tensor may not be independently 
covariantly conserved. In chapter 4 we will see the explicit time dependence of these correlation 
objects in the perturbative setting in cosmology. 



3.6 Comparing the approaches of Buchert and Zalaletdinov 

An important motivation in studying the spatial averaging limit of MG was to be able to compare 
its results with those of Buchert's spatial averaging. Buchert's averaging is the only approach apart 
from Zalaletdinov's MG, which is capable of treating inhomogeneities in a nonperturbative manner, 
although it is limited to using only scalar quantities within a chosen 3 + 1 splitting of spacetime. 
Buchert takes the trace of the Einstein equations in the inhomogeneous geometry, and averages 
these inhomogeneous scalar equations. In the context of Zalaletdinov's MG however, we have used 
the existence of the vector field v"" in the FLRW spacetime to construct scalar equations after 
averaging the full Einstein equations. As far as observations are concerned, it has been noted by 
Buchert and Carfora [42], that the spatially averaged matter density {p)ti defined by Buchert is not 
the appropriate observationally relevant quantity - the "observed" matter density (and pressure) 
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is actually defined in a homogeneous space. Since we have done precisely this in Eqn. (j3.68p . we 
are directly dealing with the appropriate observationally relevant quantity in the MG framework. 

Another important difference between the two approaches is the averaging operation itself. 
Buchert's spatial average, given by (j2.4p . is different from the averaging operation we have been 
using (given by Eqn. (j3.76p using the volume preserving gauge), which is a limit of a spacetime 
averaging defined using the coordination bivector W"'. Further, since Buchert only averages two of 
the Einstein equations, a major difference between the two schemes is the presence of the constraints 
(|3.104p on the underlying geometry. These are in general nontrivial and hence indicate that it is 
not sufficient to assume that the metric of the inhomogeneous manifold averages out to the FLRW 
form. 

Most importantly though, Buchert's averaging scheme by itself does not incorporate the concept 
of an averaged manifold M (although the work of Buchert and Carfora [32] does deal with 3-spaces 
of constant curvature). The question then arises as to how one should interpret Buchert's ax>- If one 
does not wish to identify a© with the scale factor in FLRW cosmology, one is compelled to develop 
a whole new set of ideas in order to try and compare theory with observation. On the other hand, if 
one does (naively) identify a© with the scale factor, comparison with standard cosmology becomes 
more convenient, but this introduces a possible inconsistency since we know from Zalaletdinov's 
approach that additional constraints need to be satisfied. It is not clear how one should account for 
these constraints since the non-scalar Einstein equations are not averaged in Buchert's approach. 

It is our understanding that the MG approach is a complete and self-consistent scheme which 
allows us to meaningfully pose questions in the averaging paradigm, which are directly interpretable 
in terms of standard cosmological ideas. The Buchert approach on the other hand, is harder to 
interpret. In the MG approach, there are no unaveraged shear equations, because the trace of 
the Einstein equations has been taken after performing the averaging on the underlying geometry. 
Since the averaged geometry is FLRW, its shear is zero by definition. There is a natural metric 
on the averaged manifold by construction, the FLRW metric. The correlations satisfy additional 
constraints, given by Eqns. (j3.104p . Thus, once a gauge has been chosen and if one can overcome 
the computational complexity of the averaging operation, the cosmological equations derived by us 
in the MG approach are complete and ready for application, without any further caveats. 

In spite of these differences, our equations (j3.108p and (j3.110p for the volume preserving gauge 
are strikingly similar to Buchert's effective FLRW equations and their integrability condition in 
the dust case; and in the case of general A^, the role of Buchert's dynamical backreaction Vv in 
Eqns. (pT5|) and ([239]) is identical to that of our combination of (P^^) + 5(2)) gqns. (|3.114bp 
and (j3.117p . Concentrating on the volume preserving case, the structure of the correlation Q^^^ is 
identical to Buchert's kinematical backreaction Qx) (or Qxi in the general case). The correlation 
S^^^ appears in place of the averaged 3-Ricci scalar {TZ)v in Buchert's dust equations. This is 
not unreasonable since Buchert's {'R.)v can be thought oi as, {TZ)x> = 6fcx>/a|,+ corrections, where 
6kT>/a^ represents the 3-Ricci scalar on the averaged manifold which in our case is zero, and 
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hence 5^^-* represents the corrections due to averaging. Further, these similarities are in spite of 
the fact that our correlations were defined assuming that a volume preserving gauge averages out 
to the FLRW 3-metric in standard form, whereas Buchert's averaging is most naturally adapted 
to beginning with a synchronous gauge. This remarkable feature, at least to our understanding, 
does not seem to have any deeper meaning - it simply seems to arise from the structure of the 
Einstein equations themselves, together with our assumption Dj^Z"^^*^. = 0. In the absence of 
this latter condition, one would have to consider the correlation 3- and 4-forms mentioned earlier, 
and the structure of the correlation terms and their "conservation" equations would be far more 
complicated. In the rest of this thesis we will restrict our calculations to Zalaletdinov's approaclQ. 



Chapter summary and discussion: 

This chapter dealt with Zalaletdinov's fully covariant framework for averaging Einstein's equations, 
named Macroscopic Gravity (MG). Although the details of this approach are rather involved, its 
strength lies in the fact that one can speak in terms of a physically relevant "averaged metric" whose 
behaviour is governed by a set of modified Einstein equations. We argued that in the context of 
cosmology, the standard assumptions regarding the large scale homogeneity and isotropy of the 
universe, translate to the requirement of taking a spatial averaging limit of the four-dimensional 
averaging in MG. The resulting modified cosmological equations were then compared with the 
effective equations derived by Buchert, which we discussed in the previous chapter. 

We wish to emphasize an issue which is of importance in understanding the approach we will 
take in subsequent chapters, which will deal with Zalaletdinov's averaging. There is a significant 
difference between the original philosophy of the averaging formalism, common to both the Buchert 
and Zalaletdinov schemes, and the manner in which we employ Zalaletdinov's averaging. The 
original idea as developed by these authors was to construct a framework which would independently 
describe a suitably defined averaged dynamics, with no reference to the inhomogeneous spacetime 
whose average leads to this dynamics. Zalaletdinov's MG is therefore a new theory of gravity 
describing the dynamics of an averaged manifold, with no recourse to the underlying manifold 
which is described by the usual Einstein equations. The backreaction in this approach is 

''An entirely different outlook towards his approach heis been emphasized to us by Buchert [68) . According to 
Buchert, the absence of an averaged manifold M is not to be thought of as a 'caveat', but as a feature deliberately 
retained 'on purpose'. The actual inhomogeneous universe is regarded by Buchert as the only fundamental entity, 
and the introduction of an averaged universe is in fact regarded as an unphysical and unnecessary approximation. 
As we mentioned earlier, this is probably the most important difference between MG and Buchert's approach. In the 
latter, contact with observations is to be made by constructing averaged quantities, such as the scalars defined earlier 
in this section, and by introducing the expansion factor a-p. The assertion here is that the averaging of geometry, 
as discussed in MG or in the Renormalization Group approach of Buchert and Carfora [12] is not an indispensable 
step in comparing the inhomogeneous universe with actual observations. The need for averaging of geometry is to be 
physically separated from simply looking at effective properties (such as the constructed scalars) which can be defined 
for any inhomogeneous metric. Averaging of geometry becomes relevant if (i) an observer insists on interpreting the 
data in a FLRW template model, so that (s)he needs a mapping from the actual inhomogeneous slice and its average 
properties to the corresponding properties in this template, or (ii) one desires a mock metric, to sort of have a 
thermodynamic effective metric to approximate the real one. 
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actually a new field in the problem which satisfies its own equations and whose dynamics must be 
solved for simultaneously with that of other fields such as the averaged metric and the averaged 
energy-momentum tensor for matter. 

Our approach to the backreaction issue is different : We consider it central to be able to self- 
consistently describe both the inhomogeneous geometry as well as its averaged counterpart. We find 
this necessary since modern cosmology crucially relies on observations of inhomogeneities around 
us, and ignoring the evolution of inhomogeneities when solving for the averaged dynamics does not 
appear to be satisfactory. Put another way, when faced with a solution of the averaged dynamics, 
we find it essential to answer the question "which (if any) inhomogeneous solution could lead to this 
averaged homogeneous solution?" All our subsequent calculations will therefore focus on solving for 
the averaged dynamics of specific inhomogeneities, which we attempt to keep as realistic as possible. 
We will start with an application of the spatial averaging limit of MG, to linear perturbation theory 
in cosmology, in the next chapter. The reader is referred to papers by Zalaletdinov and co-workers 
in Ref. [53], for applications of MG as a stand-alone theory for an averaged manifold. 
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Chapter 4 

Backreaction in linear perturbation 
theory 



In this chapter we wih adapt the MG formahsm and its spatial averaging Umit to the specific case 
of linear cosmological perturbation theory (PT). The motivation behind this excercise is to verify 
the self-consistency of cosmological PT in the presence of a backreaction due to averaging. In 
other words, we wish to ask whether cosmological PT is stable against the inclusion of dynamical 
backreaction terms, or whether a runaway process can render the PT invalid. 

This chapter is organised as follows : In Sec. 14.11 we collect some useful results from linear 
cosmological perturbation theory. Sec. 14.21 presents details of the MG averaging procedure adapted 
to cosmological PT, including general expressions for the leading order backreaction terms, with a 
discussion of gauge related issues and the definition of the averaging operator. The main results 
are in Sec. 14.31 where we derive final expressions for the backreaction, both in real space and 
Fourier space, which can be directly utilised in model calculations. These expressions use a few 
simplifying restrictions which can be lifted if necessary in a completely straightforward manner. 
Sec. 14.41 contains example calculations in first order PT, which show that the magnitude of the 
backreaction is, as expected, negligible compared to the homogeneous energy density of matter in 
the radiation dominated era and for a significant part of the matter dominated era. Throughout 
the chapter, a prime refers to a derivative with respect to conformal time unless stated otherwise, 
and we will assume that the metric of the universe is a perturbation around the FLRW metric 
given by 



Here a is the scale factor and rj is the conformal time coordinate related to cosmic time r by the 
differential relation 



In Eqn. ()4.ip we have allowed the spatial metric to have the general form o^^ab where -^ab is the 





dr = a{rj)dr] 




44 



metric of a 3-space of constant curvature. For the calculations in this chapter we will assume a 
flat FLRW background in coordinates such that ^ab = ^AB] however for future reference we shall 
present certain expressions in terms of the more general spatial metric. Defining 



1 da a' 

n=-—=- 

a at] a 



(4.3) 



and setting / = a in Eqns. (|3.115|) . the correction terms in the modified Friedmann and Raychaud- 
huri equations (|3.114p now read 
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where we have dropped the tildes for convenience. The averaging in Eqns. (j4.4p is assumed to be a 
spatial averaging in an unspecified spatial slicing in the inhomogeneous manifold A4; in Sec. I4.2l we 
will specify the averaging procedure more exactly. In addition, the "cross-correlation" constraints 
(|3.116p with f = a, t = r] and H = 7i must also be satisfied by the inhomogeneities. Note that we 
are not imposing the conservation condition (j3.64p . 

Before we move on to deriving formulae for the correlation terms (|4.4p in terms of perturbation 
functions in the metric, there is one issue which merits discussion. The cosmological perturbation 
setting, together with the paradigm of averaging, presents us with a rather peculiar situation. On 
the one hand, the time evolution of the scale factor is needed in order to solve the equations satisfied 
by the perturbations. Indeed, the standard practice is to fix the time evolution of the background 
once and for all, and to use this in solving for the evolution of the perturbations. On the other 
hand, the evolution of the perturbations (i.e. - the inhomogeneities) is needed to compute the 
correlation terms appearing in Eqns. (|3.114p . Until these terms are known, the behaviour with 
time of the scale factor cannot be determined; and until we know the scale factor as a function of 
time, we cannot solve for the perturbations. Note that this is a generic feature independent of all 
details of the averaging procedure. 

It would appear therefore, that we have reached an impasse. To clear this hurdle, one can try 
the following iterative approach : Symbolically denote the background as a, the inhomogeneities as 
ip, and the correlation objects as C. Note that a, and C all refer to functions of time. We start 
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with a chosen background, say a standard flat FLRW background with radiation, baryons and cold 
dark matter (CDM), and solve for the perturbations in the usual way, without accounting for the 
correlation terms C. In other words, for this "zeroth iteration", we artificially set C to zero and 
obtain a^^^ and f^^^ using the standard approach (see e.g. Ref. [E]). Clearly, since the "true" 
background (say a*) satisfies Eqns. (j3.114p with a nonzero C, we have in general a^^^ ^ a*. Now, 
using the solution (p^^\ we can calculate the zeroth iteration correlation objects C^^'^ by applying 
the prescription to be developed later in this chapter. As a first correction to the solution a^'^\ 
we now solve for a new background a^^\ with the known functions C^^^ acting as sources in Eqns. 
(j3.114p . This first iteration will then yield a solution (f^^^^ for the inhomogeneities, and hence a new 
set of correlation terms C^^^ , and this procedure can be repeatedly applied. Pictorially, 

a(0) ^ ^(0) _^ c^o) _^ ^(1) _^ ^(1) _^ (4^5) 

As for convergence, if perturbation theory is in fact a good approximation to the real universe, 
then one can expect that the correlation terms will tend to be small compared to other background 
objects, and will therefore not affect the background significantly at each iteration, leading to rapid 
convergence. On the other hand, if the correlation terms are large, this procedure may not converge 
and one might expect a breakdown of the perturbative picture itself. We will see that in the linear 
regime of cosmological perturbation theory, the correlation terms do in fact remain negligibly small. 

4.1 Metric perturbations in cosmology 

For ready reference, in this subsection we present expressions for the metric, its inverse, and the 
Christoffel connection in first order cosmological PT, in an arbitrary, unfixed gauge. The notation 
we use is similar to that used in Ref. [63]. We will also give expressions for the first order gauge 
transformations of the perturbation functions (see e.g. Ref. [M])- The first order perturbed FLRW 
metric in an arbitrary gauge and in terms of conformal time r], can be written as 

ds^ = a\r]) [-(1 + 2^)dr]^ + 2ujAdx^dr] + ((1 - 2V')7AB + Xab) dx^dx^] . (4.6) 

The functions (f and ^ are scalars under spatial coordinate transformations. The functions uja and 
XAB can be decomposed as follows 

iOA = dAUJ + CUA ; XAB = DabX + '^'^{AXB)+XAB, (4.7) 

where the parentheses indicate symmetrization; Dab is the tracefree second derivative defined by 

DAB = VAVB-{l/3hABV^ ; V2 = 7^^VaVs, (4.8) 
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with Va the covariant spatial derivative compatible with ^ab] and uja-, Xa and XAB satisfy 

V^(i^ = = VAX^ ; VAxi = = XA, (4.9) 

where spatial indices are raised and lowered using ^ab and its inverse 7"^^. Prom their definitions 
it is clear that (p, ip, u) and x each correspond to one scalar degree of freedom, the transverse 
3-vectors uja and XA each correspond to two functional degrees of freedom, and the transverse 
tracefree 3-tensor XAB corresponds also to two functional degrees of freedom. This totals to 10 
degrees of freedom, of which 4 are coordinate degrees of freedom which can be arbitrarily fixed, 
which is what one means by a gauge choice. For example, the conformal Newtonian or longitudinal 
or Poisson gauge [131 El] is defined by the conditions 

w = o = x ; x^ = o. (4.10) 

For the metric (|4.6p we have at first order, 

V-det5 = a{r]f (1 + 99 - 3^^) • (4.11) 
The inverse of metric (|4.6p . correct to first order, has the components 

= ^ ((1 + 2V')7^^ - X^"") ■ (4.12) 
With Ti. = {a' /a), the first order accurate Christoffel symbols are 
r[Jo = W + (^' ; T'^oA = dAip + nLOA ; T^o = d'^ip + iu'^' + nu'^, 

^AB = {n-^' - 2n{^ + ^)) -fAB - ViA^B) + \x'ab + WXAB , 

T^B = - V) + \ (Vi.^^ - V^i^B) + ^X^' , 

r^c = ^'^f - {^idci' + 6^dB^ - iBcd'^i^) - Huj^^BC + \ (Vcxs + ^ bXc " ^\bc) , 

(4.13) 

where '■'^^f denotes the Christoffel connection associated with the homogeneous 3-metric 7ab- 
4.1.1 Gauge transformations 

While the concept of gauge transformations can be described in a rather sophisticated language 
using pullback operators between manifolds [64|, for our purposes it suffices to implement a gauge 
transformation using the simpler notion of an infinitesimal coordinate transformation (also known 
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as the "passive" point of view). Hence, denoting the coordinates and perturbation functions in the 
new gauge with a tilde (i.e. x'^, cD^, and so on), we have 

x'' = x^ + Cix) ; x" = x"-r, (4.14) 

where the infinitesimal 4-vector can be decomposed as 

C = {e,C^) = {a,d''P + d^) , (4.15) 

where a and /3 are scalars and is a transverse 3-vector satisfying V^d"^ = 0. 

It is then easy to show that if this transformation is assumed to change the metric (14. 6p by 
changing only the perturbation functions but leaving the background intact (a so-called "steady" 
coordinate transformation), then the old perturbations and the new are related by |64J 

if = if + a' + Ha , 
to = uj — a + f3' , 

ti^^ = + d^' , 

X = X + 2P, 

x^ = i^ + d^, 

XAB = XAB ■ (4.16) 

The last equality shows that the transverse tracefree tensor perturbations are gauge invariant. They 
correspond to gravitational waves. 



4.2 The Averaging Operation and Gauge Related Issues 

In this section, we will describe the details of the MG (spatial) averaging procedure adapted to the 
setting of cosmological PT. 

4.2.1 Volume Preserving (VP) Gauges and the Correlation Scalars 

It will greatly simplify the discussion if we start with symbolic calculations which allow us to see 
the broad structure of the objects we are after. Since the correlation objects in Eqns. ()3.114p 
depend only on derivatives of the metric, we will primarily deal with metric fluctuations; matter 
perturbations will only come into play when solving for the actual dynamics of the system. Before 
dealing with the issue of which gauge to choose in order to set the condition (j3.36p with the average 
connection taken to be the FLRW one, we will show that irrespective of this choice, the leading 
order contribution to the correlations requires knowledge of only first order perturbation functions. 
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We will use the following symbolic notation : 

• Inhomogeneous connection: F 

• FLRW connection: Tp 

• Perturbation in the connection : ST = T - Tp = ST^^^ + ^F^^) + . . . 

• Coordination bivector : W = 1 + SW = 1 + 6W^^^ + SW^^'^ + ... 

• Bilocal extension of the connection : F 

• Inhomogeneous part of the bilocal extension of the connection : (5F = F — F^? = ST^ ^ +ST^ ^ + 

• Correlation object : C 

The integer superscripts denote the order of perturbation. The form of the coordination bivector 
arises from the fact that in perturbation theory, in the spatial averaging limit, a transformation 
from an arbitrary gauge to a VP one can be achieved by an infinitesimal coordinate transformation. 
By a VP gauge wc mean a gauge in which the metric determinant is independent of the spatial 
coordinates to the relevant order in PT, but may be a function of time. It can be shown that such 
a function of time (which will typically be some power of the scale factor) , is completely consistent 
with all definitions and requirements of MG in the spatial averaging limit. An easy way of seeing 
this is to note that in any averaged quantity, the metric determinant appears in two integrals, one 
in the numerator and the other in the denominator (which gives the normalising volume). In the 
"thin time slicing" approximation we are using to define the averaging, any overall time dependent 
factor in the metric determinant therefore cancels out. Also, a fully volume preserving coordinate 
system can clearly be obtained from any VP gauge as defined above, by a suitable rescaling of the 
time coordinate. It is not hard to show that in the thin time slicing approximation, this gives the 
same coordination bivector W^'{x',x) as the VP gauge definition above. 

To see that first order perturbations are sufficient to calculate C to leading order, we only have 
to note that the background connection Tp satisfies 

{TF) = rF, (4.17) 

and that the structure of the correlation is C = ( F^ ) — ( F )^, which then leads to 

c = {sr'^) - {srf , (4.18) 

which is exact. Clearly, the correlation is quadratic in the perturbation as expected, and hence to 
leading order, ST above can be replaced by ST^ \ 
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Eqns. (j4.17p and (j4.18p treat the averaging operation at a conceptual level only. To make 
progress however, we also need to prescribe how to practically impose the averaging assumption 



(r) = Tf i.e. (<5r) = 0, (4.19) 

in any given perturbative context. This requires some discussion since, for example, the bilocal 
extension of the connection F has the structure 

f = W-^rW^ + W'^id + Wd')W , (4.20) 

where 5 is a derivative at x and d' a derivative at x' (see Eqn. (j3.35p ). The actual MG averaging 
operation in general is therefore a rather involved procedure. Additionally, it is also necessary to 
address certain gauge related issues. 

To clarify the situation, let us start with a fictitious setting in which the geometry has exactly 
the flat FLRW form, with no physical perturbations. Clearly, if we work in the standard comoving 
coordinates in which the metric 7^5 of Eqn. (|4.ip is simply ^jab = ^AB , then since these coordinates 
are volume preserving in the sense described above, the coordination bivector becomes trivial. The 
averaging involves a simple integration over 3-space, and we can easily see that Eqn. (|4.17p is 
explicitly recovered. 

Now suppose that we perform an infinitesimal coordinate transformation, after imposing Eqn. 
(|4.17p . Since the averaging operation is covariant, then from the point of view of a general coordinate 
transformation, both sides of Eqn. (|4.17p will be affected in the same way. However, suppose that 
we had performed the transformation before imposing Eqn. (j4.17p . In the language of cosmological 
PT, we would then be dealing with some "pure gauge" perturbations around the fixed, spatially 
homogeneous background. If we did not know that these perturbations were pure gauge, we might 
naively construct the nontrivial coordination bivector for this metric, compute the bilocal extension 
of the connection according to Eqn. (j4.20p and try to impose Eqn. (j4.19p . This would be incorrect 
since these perturbations were arbitrarily generated and need not average to zero (for example 
they could be positive definite functions). In order to maintain consistency, it is then necessary to 
ensure in practice that the averaging condition (j4.19p is applied only to gauge invariant fluctuations 
(which is rather obvious in hindsight). 

There is another problem associated with the structure of the coordination bivector, even when 
there are real, gauge invariant inhomogeneities present. Note from Eqns. (|3.24p and (|3.26p that 
the coordination bivector has the structure 



W 



dx 



V 



dxv 



dx 



(4.21) 



where x denotes the coordinates we are working in and xy a set of VPCs. In perturbation theory 
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(in the spatial averaging limit) we will have, at leading order, 

x = xv-C ; xv = x + ^, (4.22) 
where ^ symbolically denotes an infinitesimal 4-vector defining the transformation, and hence 

idxv)/{dx) = l + d^, (4.23) 

and so on, which gives us 

W = l- (50U' + idO\x + . . . = 1 + 5W^^^ + .... (4.24) 

Now when we compute a quantity such as {TpdW^^'^ ) which appears in the expression (j4.20p for 
(r) , we will be left with a fluctuating (x-dependent) term of the form TF{{d^) — 9^), where x 
denotes the 3 spatial coordinates. Hence if we try to impose Eqn. (j4.19p we will ultimately be left 
with equations of the type 

(/)(f)-/(x) = 0, (4.25) 

for some functions derived from the inhomogeneities which we have collectively denoted /. In 
other words, consistency would seem to demand that the inhomogeneities vanish in this coordinate 
system, which is of course not desirable. 

It therefore appears that we are forced to impose Eqn. (|4.19|) in a volume preserving gauge, 
since by definition, only in such a gauge will we have W = 1 exactly. We emphasize that this is 
a purely practical aspect related to defining the averaging operation, and is completely decoupled 
from, e.g. the choice of gauge made when studying the time evolution of perturbations. We are 
in no way breaking the usual notion of gauge invariance by choosing an averaging operator. The 
conditions Eqn. ()4.25p now reduce to the form 

{fvpc){x) = 0, (4.26) 

which are far more natural than Eqn. (j4.25p . The averaging condition is now unambiguous, but 
depends on a choice of the VP gauge which defines the averaging operation, an issue we shall 
discuss in the next subsection. For now, all we can assert is that this VP gauge must be such that 
in the absence of gauge invariant fluctuations, it must reduce to the standard comoving (volume 
preserving) coordinates of the background geometry as in Eqn. (j4.ip . This of course is simply the 
statement that the VP gauge must be well defined and must not contain any residual degrees of 
freedom. 

The averaging operation now takes on an almost trivial form as we have seen in chapter 3 - to 
leading order, for any quantity f{r],x) (with or without indices), the spatial average of / in a VP 
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gauge is given by 

{f){v,x) = ^[ d'yf{ri,y), (4.27) 

Jv{x) 

where the integral is over a spatial domain V(x) with a constant volume Vl. The spatial coordinates 
are the comoving coordinates of the background metric, and at leading order the boundaries of V(x) 
can be specified in a straightforward manner as, e.g., 

V(f ) = {y\ x^- L/2 <y^ <x^ + L/2, A = 1,2, 3.} , (4.28) 

where L is a comoving scale over which the averaging is performed (in which case Vl = L^)- The 
averaging definition can be written more compactly in terms of a window function {x, y) as 

{f){v,x)= I <fyWL{x, y)f{ri, y) ; J d^W^x, y) = 1 , (4.29) 

where WL{x,y) vanishes everywhere except in the region V{x), with the integrals now being over 
all space. This expression will come in handy when working in Fourier space, as we shall do in later 
sections. 

A couple of comments are in order at this stage. Firstly, we have not specified the magnitude 
of the averaging scale L. The general philosophy is that this scale must be large enough that a 
single averaging domain encompasses several realisations of the random inhomogeneous fluctuations, 
and small enough that the observable universe contains a large number of averaging domains. 
However, as we will show later in Sec. 14.31 if one is ultimately interested in quantities which are 
formally averaged over an ensemble of realisations of the universe (as is usually done in interpreting 
observations), then the actual value of the averaging scale becomes irrelevant. 

This brings us to the second issue. The above discussion is valid only in the situation where 
there are no fluctuations at arbitrarily large length scales, since in the presence of such fluctuations 
the averaging condition (j4.19p loses meaning (in such a situation it would be impossible to isolate 
the background from the perturbation by an averaging operation on any finite length scale). Indeed, 
we shall see a manifestation of this restriction in Sec. 14.31 where the correlation scalars will be seen 
to diverge in the presence of a nonzero amplitude at arbitrarily large scales, of the power spectrum 
of metric fluctuations. 

We will end this subsection by explicitly writing out the averaging condition in an "unflxed 
VP" gauge, to be deflned below, and also writing the correlation terms appearing in Eqn. (|3.114|) . 
in this gauge. As we can see from Eqn. ()4.1ip . the basic condition to be satisfied by a VP gauge is 

<f = 3i>. (4.30) 

Hereafter, all VP gauge quantities will be denoted using a tilde. This should not be confused with 
the similar notation that was used so far for the bilocal extension, which will not be needed in the 
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rest of the chapter, and are the scalar potentials appearing in the perturbed FLRW metric 
(j4.6p . The single condition (|4.30p leaves 3 degrees of freedom to be fixed, in order to completely 
specify the VP gauge one is working with. The MG formalism by itself does not prescribe a method 
to choose a particular VPC system; in fact this freedom of choice of VPCs is an inherent part of 
the formalism. We shall return to this issue in the next subsection. For now we define the "unfixed 
VP (uVP) gauge" by the single requirement (j4.30p . with 3 unfixed degrees of freedom, and present 
the expressions for the averaging condition and the correlation scalars, with this choice. 

It is straightforward to determine the consequences of requiring Eqn. (|3.36p to hold, with the 
right hand side corresponding to the FLRW connection in conformal coordinates, and remembering 
that the coordination bivector (in the spatial averaging limit) is now simply a Kronecker delta. 
Together with some additional reasonable requirements, namely 

(V^s) = = (V^a^s), (4.31) 

for any scalar s{r],x), the averaging condition in the uVP gauge reduces to 

(^) = ; {dAi')=0 = {i''), 
{^a) = = {oj'a) ; {x'ab) = 0, 
(VcXb) + (VbXc)-(V^Xbc) = 0, 

( Va^b ) = ( VblDa )=n{xAB), (4.32) 

where we have used the expressions in Eqn. ()4.13p with the uVP condition ()4.30p . We will also 
make the additional reasonable requirement that 

{xab) = 0, (4.33) 

using which it is easy to see that the perturbed FLRW metric (j4.6p and its inverse (j4.12p . in the 
uVP gauge, both on averaging reduce to their respective homogeneous counterparts, namely 

(5a.)=5i™^ ; {9''')=9fFLRW)- (4-34) 
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Using these results, the expressions (j4.4p simplify to give, in the uVP gauge. 



p(i) = _ 



6((V;')') + (V[a^b]V[^cD^1)-^(x'abX 



ABf ' 



(4.35a) 



5(1) = 4 



2( Oa^'VbX^'' ) + ^( V^x^^ {2VAXBC - VbXac) ] 



(4.35b) 



p(l)+p(2)^ 1 



5(2) = 4 



A~i 



lix'AB^''^'': 



■^{x'ab {x 



(4.35c) 
(4.35d) 



where square brackets denote antisymmetrization. 
4.2.2 Choice of VP Gauge 

In this subsection we will prescribe a choice for the VP gauge which defines the averaging operation. 
In general, the class of volume preserving coordinate systems for any spacetime, is very large (see 
Ref. [44j for a detailed characterisation). We have so far managed to pare it down by requiring 
that the VP gauge we choose should reduce to the standard FLRW coordinates in the absence of 
fluctuations. It turns out to be somewhat difficult to go beyond this step, since there does not 
appear to be any unambiguously clear guiding principle governing this choice. We will therefore 
motivate a choice for the VP gauge based on certain details of cosmological PT which one knows 
from the standard treatments of the subject. 

In particular, we shall make use of certain nice properties of the conformal Newtonian or longi- 
tudinal or Poisson gauge, which is defined by the conditions ()4.10p [64j (henceforth we shall refer to 
this gauge as the cN gauge for short). Since this gauge is well defined and has no residual degrees 
of freedom, all the nonzero perturbation functions in the cN gauge, namely 99, il), Cja and xab in 
the notation of Appendix C, are equal to gauge invariant objects. This is trivially true for xab^ 
as seen in the last equation in (|4.16p . For the rest, note that in any arbitrary unfixed gauge, the 
following combinations are gauge invariant at first order 



^B = V+ -Or, 

a 



a uj 



2^ 



Va = u:a - Xa, 



1 / 



1. 



(4.36) 
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which can be easily checked using Eqns. (j4.16p . and in the cN gauge, w, x and XA ah vanish. Here 
$B and are the Bardeen potentials [69l (upto a sign), and ^'^ in particular has the physical 
interpretation of giving the gauge invariant curvature perturbation, which is the quantity on which 
initial conditions are imposed post inflation |70j . 

Additionally, it is also known that the cN gauge for the metric remains stable even during 
structure formation, when matter inhomogeneities have become completely nonlineai0. We believe 
that this is a strong argument in favour of using the cN gauge to define a VP gauge which will then 
define the averaging operation in the perturbative context. This will ensure that this "truncated" 
averaging operation, defined for first order PT, will remain valid at leading order even during the 
nonlinear epochs of structure formation. 

To implement this in practice, consider a transformation from the cN gauge to the uVP gauge 
defined by Eqn. (j4.30p . The transformation equations (j4.16p reduce to 

a' + ma + V^/? = V3 - 3V' , 

= -if — a' — Ha , 

Co = a — P' , 

= (i^ - d^' , 
X = -2/?, 

^^ = -d^ 

XAB = XAB ■ (4.37) 

Recall that to completely specify a VP gauge, we need to fix 3 degrees of freedom in the uVP gauge. 
Our requirement regarding the "well defined"-ness of the VP gauge, forces us to set d'^ = 0, and 
to choose a and /? such that they vanish in the case where if = = ip. 

This has fixed 2 degrees of freedom, in addition to the condition (j4.30p which is just the definition 
of the uVP gauge, and has hence not yielded a uniquely specified VP gauge. To do this, we shall 
make the following additional requirement. Since we are dealing with a spatial averaging, it seems 
reasonable to require that the VP gauge being used to define the averaging, should be "as close 
as possible" to the cN gauge in terms of time slicing, and for this reason we shall set the function 
a to zero. To summarize, the VP gauge chosen is defined in terms of the gauge transformation 
functions = (a, d^P + d'^) between the cN gauge and the VP gauge, by the following relations 

a = = , (4.38) 

^See Ref. [27) for an intuitive description of why this is so. We will also see an explicit demonstration in a toy 
model of structure formation in chapter 5. 
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and 



= 3V' = , (4.39a) 

V2/3 = 99 - SV' , (4.39b) 

Co = -13' ; X = -2/3, (4.39c) 

^A = 0, (4.39d) 

= ; Xab = XAB , (4.39e) 



where the function (3 is restricted not to contain any nontrivial solution of the homogeneous 
(Laplace) equation V^/? = 0. 

Having made this choice for the VP gauge, we are now assured that all averaged quantities 
which we compute are gauge invariant : our choice ensures that the averaging procedure does 
not introduce any pure gauge modes, and the philosophy of "steady" coordinate transformations 
ensures that all background objects are, by assumption, unaffected by gauge transformations. In 
particular, the correlation objects in Eqns. ()4.4p are all gauge invariant^]. This is different from the 
gauge invariance conditions derived in Ref. [29], where the background was also taken to change 
under gauge transformations at second order in the perturbations. It is at present not clear how 
these results are related to ours. 



4.3 The Correlation Scalars 

With the VP gauge choice defined by Eqns. (j4.39p . it is straightforward to rewrite the correlation 
objects in Eqns. (j4.35p (which are in the uVP gauge) in terms of the perturbation functions in the 
cN gauge. We will restrict the subsequent calculations in this chapter to the case where there are 
no transverse vector perturbations, i.e., 

WA = , (4.40) 

in the cN gauge. This is a reasonable choice since such vector perturbations, even if they are excited 
in the initial conditions, decay rapidly and do not source the other perturbations at first order [12j. 
In addition, for simplicity we will choose to ignore the gauge invariant tensor perturbations as well, 

XAB = Q. (4.41) 

■^Note that all these arguments are valid at first order in PT, which is sufficient for our present purposes. A 
consistent treatment at second order would require more work, although as long as one is interested only in the 
leading order effect, these arguments are expected to go through. 
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In terms of the scalar perturbations in the cN gauge, for a flat FLRW background, the correlation 
objects ()4.35p reduce to 



2( {i^'f ) + ( - V') ) - ( (VaVb/?0 (V^V V) > 



(4.42a) 



5(1) 



6( OaV'S^V' > + ( dA{^ - V')5^(¥' - V-) > - ( (VaVbVc/3)(V^V^V^/3) 



(4.42b) 



p(i)+p(2) = i, 



( ^' (y,' - > - 2W { ( 9^ V > - ( V ) + ( V'' (9^ - V') ) 

+( V(v^' - V'O ) + ( (VaVb/3)(V^V^/?') ) } 
(9V (a^^ - ?^9a/3') 



(4.42c) 
(4.42d) 



where [5 is defined in Eqn. (|4.39bp . 

Since we are working with a flat FLRW background, it becomes convenient to transform our 
expressions in terms of Fourier space variables. This will also highlight the problem with large 
scale fluctuations which was mentioned in Sec. 14. 2i We will use the following Fourier transform 
conventions : For any scalar function f{i],x), its Fourier transform f^{i]) satisfies 



ik-x 



(4.43) 



Consider an average of a generic quadratic product of two scalars f^^\x) and f^'^\x) where we 
have suppressed the time dependence since it simply goes along for a ride. Using the definition 
(j4.29p . and keeping in mind that the scalars are real, it is easy to show that we have 



(/«/(2))(x) 



(27r)6 



(4.44) 



where WL{k,x) is the Fourier transform of the window function WL{x,y) on the variable y, and 
the asterisk denotes a complex conjugate. 

In the present context, the functions f^^^ and /^^^ will typically be derived in terms of the 
initial random fluctuations in the metric (p^. which are assumed to be drawn from a statistically 
homogeneous and isotropic Gaussian distribution with some given power spectrum. In order to 
ultimately make contact with observations, it seems necessary to perform a formal ensemble average 
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over all possible realisations of this initial distribution of fluctuations. The statistical homogeneity 
and isotropy of the initial distribution implies that the functions /^^^ and /^^^ will satisfy a relation 
of the type 

[/£V|^*]e„. = {27rf6^'Hh - k2)Pf.f,{\ki\) , (4.45) 

for some function Pf^f^{k^rj) which is derivable in terms of the initial power spectrum of metric 
fluctuations, and where [...]ens denotes an ensemble average and 5^^\k) is the Dirac delta distri- 
bution. 

Applying an ensemble average to Eqn. (|4.44p introduces a Dirac delta which forces ki = k2. 
Further, the normalisation condition on the window function in Eqn. (j4.29p implies that we have 

WL{k = 0,x) = 1, (4.46) 

which means that all dependence on the averaging scale and domain drops out, and we are left 
with 

[(/^^V(^))U = / ^/./.(^)- (4-47) 

Note however, that the right hand side of Eqn. (j4.47p is precisely what we would have obtained, 
had we treated the spatial average (...) to be the ensemble average [ ... \ens to begin with. Therefore 
for all practical purposes, we are justified in replacing all the spatial averages in the expressions for 
the correlation scalars (j4.42p . by ensemble averages. 

It is convenient to define the transfer function ^kiv) '^i^ the relation 

iPf:{7j) = iPf:,^k{v) ■ (4.48) 

For the calculations in this chapter, we shall assume that the cN gauge scalars f{ri, x) and ip{r], x) 
are equal 

ip{r],x) = Tp(r],x) , (4.49) 

a choice which is valid in first order PT when anisotropic stresses are negligible (see Ref. [12j). 
This simplifies many of the expressions we are dealing with. The Fourier transform of /? can be 
written, using Eqns. (|4.39bp and (j4.49p . as 

W = ^^dv) ■ (4.50) 

Finally, in terms of the transfer function ^kiij) the initial power spectrum of metric fluctuations 
defined by 

[^k\^^*kJens = i27r)'S(^\k, - k2)P^,{k,) , (4.51) 
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the correlation scalars (j4.42p can be written as 



p(i) 
5(1) 
70(1) +70(2) 
5(2) 

These expressions highhght the problem of having a finite amplitude for fluctuations at arbitrarily 
large length scales (fc — >■ 0), which was mentioned in Sec. I4.2i As a concrete example, consider the 
frequently discussed Harrison-Zel'dovich scale invariant spectrum [71j which satisfies the condition 

k^Pipiik) = constant . (4.53) 

Eqns. ()4.52p now show that if the transfer function ^kiv) has a finite time derivative at large scales 
(as it does in the standard scenarios - see the next section), then the correlation objects V^^\ V^"^^ 
and S^'^^ all diverge due to contributions from the k ^ regime. This demonstrates the importance 
of having an initial power spectrum in which the amplitude dies down sufficiently rapidly on large 
length scales (which is a known issue, see Ref. [TOj). Keeping this in mind, we shall concentrate 
on initial power spectra which display a long wavelength cutoff. Models of inflation leading to 
such power spectra have been discussed in the literature [72], and more encouragingly, analyses of 
WMAP data seem to indicate that such a cutoff in the initial power spectrum is in fact realised in 
the universe [73] . 

A final comment before proceeding to explicit calculations : In addition to picking up nontrivial 
correlation corrections in the cosmological equations, the averaging formalism also requires that the 
"cross-correlation" constraints in Eqns. (|3.116p be satisfied. It is straightforward to show that the 
statistical homogeneity and isotropy of the metric fluctuations implies that these constraints are 
identically satisfled, for all types of perturbations (scalar, vector and tensor). At the lowest order 
therefore, these constraints do not impose any additional conditions on the perturbation theory, 
which is reassuring. 



= -|/ ^,k'P,.ik)i<^',)\ (4.52a) 
= -| / ^^'^'P.^ik) {k'^D , (4.52b) 
= / ^k'P^k) i^kK) , (4.52c) 
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4.4 Worked out examples 



We will now turn to some explicit calculations of the backreaction, which will show that the 
magnitude of the effect remains negligibly small for most of the evolution duration in which linear 
PT is valid. At early times, linear PT is valid at practically all scales including the smallest scales 
at which we wish to apply general relativity. As matter fluctuations grow, the small length scales 
progressively approach nonlinearity, and linear PT breaks down at these scales. As we will see, 
however, by the time a particular length scale becomes nonlinear, its contribution to the amplitude 
of the metric fluctuations correspondingly becomes negligible. In practice therefore, one can extend 
the linear calculation well into the matter dominated era, with the expectation that the order of 
magnitude of the various integrals will not change significantly due to nonlinear effects. 

The model we will use is the standard Cold Dark Matter (sCDM) model consisting of radiation 
and CDM [12j. We will neglect the contribution of baryons, and at the end we shall discuss the 
effects this may have on the final results. We shall also discuss, without explicit calculation, the 
effects which the introduction of a cosmological constant is likely to have. In the following, fi^ 
and denote the density parameters of radiation and CDM respectively at the present epoch tq, 
with T denoting cosmic time. 0^. is assumed to contain contributions from photons and 3 species 
of massless, out-of-equilibrium neutrinos. At the "zeroth iteration" we have 



1 da\^ 



(4.54) 



where Hq is the standard Hubble constant, the scale factor is normalised so that a(ro) = 1, and 7i 
and H are related by 

rL{a) = aH{a) . (4.55) 

The comoving wavenumber corresponding to the scale which enters at the matter radiation equality 
epoch, is given by 

/2q2 \ 

keq = a,qH{a,,) = HJ-^\ ^ Hq ■ W''/\ (4.56) 
where we have set (see Refs. [JH [12] for details) 

fij. — ^photon ~l~ '^^neutrino 



^photon (-'■ + •^■8(^1] / 



= 4.15 X IQ-^'h-^ , (4.57) 

where h is the dimensionless Hubble parameter defined by Hq = lOO/i km/s/Mpc. For all calcula- 
tions we shall set /i = 0.72 [75l. 
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4.4.1 EdS background and non-evolving potentials 

Before dealing with the fuh model (which requires a numerical evolution) let us consider the simpler 
situation, described by an Einstein-deSitter (EdS) background, with negligible radiation and a 
nonevolving potential 99 = ip{x) (which is a consistent solution of the Einstein equations in the 
sCDM model at least at subhorizon scales at late times [l2j). Although not fully accurate, this 
example requires some very simple integrals and will help to give us a feel for the structure and 
magnitude of the backreaction. 

With a constant potential, the only correlation object which survives is S^^\ which evolves like 
~ a^^, where the scale factor refers to the "zeroth iteration". The constant of proportionality can 
be written in terms of the BBKS transfer function TBBKsik/keq) [ZS1II2], to give 

S^'^ = -^l ^^k'PAk)TlBKs{k/keq) , (4.58) 



where we have |76j 

-0.25 



ln[l+0.mx] 
= (0.171X) 



1 + 0.284X + (l.lSa;)^ + (0.399x)3 + (0.490x)' 



(4.59) 



where x = (k/keq). 

The integral in Eqn. (j4.58p is well-behaved even in the presence of power at arbitrarily large 
scales, for a (nearly) scale invariant spectrum. Since we are only looking for an estimate, we shall 
evaluate the integral in the absence of a large scale cutoff, and leave a more accurate calculation 
for the next subsection. For the initial spectrum given by 

^^M^ = A{k/Hor-\ (4.60) 

where the scalar spectral index Ug is close to unity, the integral in Eqn. (I4.58P can be easily 
performed numerically and has the order of magnitude 

/dk 
— k^P^,{k)Tlj,j,si.k/keq) ~ A ik,g f ~ AHl . 10^ , (4.61) 

upto a numerical prefactor of order 1. Since the amplitude of the power spectrum is A 10^^ [77j , 
the overall contribution of the backreaction is 

5(1) 1 

TFT ~ -72 (10"')- (4-62) 

Now, as long as the correlation objects give a negligible backreaction to the usual background 
quantities, when we proceed with the next iteration, the effect of the backreaction on the evolution 
of the perturbations will also remain negligible (at least at the leading order). Hence in practice 
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there will be essentially no difference between the zeroth iteration and first iteration perturbation 
functions. This amounts to saying that when the backreaction is negligible, convergence to the 
"true" solution for the scale factor at the leading order, is essentially achieved in a single calculation. 
We will discuss the issue of convergence in somewhat more detail at the end of the chapter. 



4.4.2 Radiation and CDM without baryons 

Let us now turn to the full sCDM model (without baryons). An analytical discussion of this model 
in various regions of {k, r/)-space, can be found e.g. in Ref. [12]. Since we are interested in integrals 
over k across a range of epochs rj, it is most convenient to solve this model numerically. It is further 
convenient to use (In a) in place of rj, as the variable with which to advance the solution. Also, 
it is useful to introduce transfer functions like ^kiv) for all the relevant perturbation functions in 
exactly the same manner (see Eqn. (|4.48p ). namely by pulling out a factor of tp^., since the initial 
conditions are completely specified by the initial metric perturbation. For a generic perturbation 
function s^{r]) (other than the metric fluctuation (^^) the transfer function corresponding to s will 
be denoted by a caret, so that 

= 'Pkz^kir]) ■ (4.63) 

The relevant Einstein equations can be brought to the following closed set of first order ordinary 
differential equations (adapted from Eqns. (7.11)-(7.15) of Ref. |12j). 



d{lna) 

dSk 
d{lna) 

d&ok 
d{lna) 

dQik 
9(lna) 

dVk 
9(ln a) 



\ 1 / . 4 ~ 

l + 3^r^+2^r-^^ + a^^®°^ 



-— Vfc + 3—- 7 

E d[\na) 



' E^^""^ d{\na) 



Je (®°^ + 
K 

-Vk + —^k ■ 



(4.64a) 
(4.64b) 
(4.64c) 
(4.64d) 
(4.64e) 



Here we have introduced the dimensionless variables 



i^^ — ; E{a) 
-Wo 



n{a) _ nja) 

Ho 



(4.65) 



and the various perturbation functions are defined as follows : 6k is the fc-space density contrast of 
CDM, 0ofc and Qik are the monopole and dipole moments respectively of the A:-space temperature 
fluctuation of radiation, and {—iVk) is the /c-space peculiar velocity scalar potential of CDM (i.e., 
the real space peculiar velocity is va = Oav where v is the Fourier transform of {—iVk))- 

Assuming adiabatic perturbations, the initial conditions satisfied by the transfer functions at 
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a = tti are (adapted from Ch. 6 of Ref. [12] ) 



^k{ai) = l ; 4(«i) = -^ ; ©ofc(aj) = -^ ; Vk{ai) = SQikitti) = I- (4.66) 

2 2 2 r!j[ai) 

We choose Oj = 10~^^, which corresponds to an initial background radiation temperature of T ~ 
lO^GeV. While this is not as far back in the past as the energy scale of inflation (which is closer 
to ~ IQ-'^^Gev), it is on the edge of the energy scale where known physics begins [70J. This makes 
Eqn. (j4.57p unrealistic since we have ignored all of Big Bang nucleosynthesis and also the fact 
that neutrinos were in equilibrium with other species at temperatures higher than about IMev. 
However the modifications due to these additional details are not expected to drastically change 
the final results, and these assumptions lead to some simplifications in the code used. The goal 
here is only to demonstrate an application of the formalism; more realistic calculations accounting 
for the effects of baryons can also be performed (see, e.g. Behrend et al. [l9] who incorporate these 
effects for the post-recombination era, albeit in the Buchert formalism). 

In order to partially account for the fact that inflationary initial conditions are actually set much 
earlier than a = 10~^^, we impose an absolute small wavelength cutoff at the scale which enters 
the horizon at the initial epoch which we have chosen. In the above notation this corresponds to 
setting Kjnax = E{ai) ~ 10^^. This makes sense since scales satisfying K » Kmax have already 
entered the horizon and decayed considerably by the epoch a = 10^^^. There is a source of error 
due to ignoring scales K > Kmax which have not yet decayed significantly, but this error rapidly 
decreases with time as progressively larger length scales enter the horizon and decay. [In fact, in 
practice to compute the integrals at any given epoch a = a,,, one only needs to have followed the 
evolution of modes with < ~ 5000£'(a*) : more on this in the next subsection.] More important 
is the cutoff at long wavelengths, which we set at Kmin = 1 (corresponding to kmin = -f^o)) which 
is firstly a natural choice given that H^^ is the only large scale in the system, and is secondly also 
guided by analyses of CMB data which have detected such a cutoff [73] . We will see that reducing 
Kmin even by a few orders of magnitude, does not affect the final qualitative results significantly. 



Numerical Results 



Equations ()4.64p with initial conditions ()4.66p were solved with a standard 4th order Runge-Kutta 
integrator with adaptive stepsize control (based on the algorithm given in Ref. [ZS])- For the 
integrals in Eqns. (j4.52p . only the function $A:(a) needs to be tracked accurately. Hence, although 
@Qk ^-iid Qik are difficult to follow accurately beyond the matter radiation equality Ogg = (fir/^m) — 
8 X 10^^ due to rapid oscillations, the integrals can still be reliably computed since Qok and Qik do 
not significantly affect the evolution of in the matter dominated era (as seen in Eqn. (j4.64ap ). 



To see that known results are being reproduced by the code, consider Figs. 4.1(a) and 4.1(b) 



as 



examples. Fig. 4.1(a) shows the evolution of two scales corresponding to K = 1 {k = HoMpc 
and K = 0.01. The first enters the horizon at the present epoch, while the second remains super- 
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(a) The evolution of two large scale modes. Also 
shown is the Kodama-Sasaki analytical solution in 
the large scale limit h] ^ 1, given by Eqn. (|4.67p . 
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k(li Mpc"') 

(b) The transfer function normalised by its con- 
stant value at large scales, at the epoch a = 500aeq. 
The dotted line is the BBKS transfer fimction (|4?59ll . 



Figure 4.1: Numerical results for the transfer function. 



horizon for the entire evolution, satisfying krj <^ 1. In this limit an analytical solution exists in the 
sCDM model, due to Kodama and Sasaki [79l[12], given by 



1 



16^1 + y + 9y^ + 2y^ - 8y - 16 



(4.67) 



where y = a/aeq, and this function is also shown. Clearly all the curves in Fig. 4.1(a) are practically 
identical. Fig. 4.1(b) shows the function <I>fc normalised by its (constant) value at large scales, at 
the epoch a = SOOagg ~ 0.04, (which is well into the matter dominated era). The dotted line is the 
BBKS fitting form given in Eqn. (j4.59p with k^q given by Eqn. (j4.56p . 

To numerically estimate the integrals in Eqns. (j4.52p . the values of and its first and second 
derivatives with respect to (In a) are needed across a range of K values. For reference, note that 
the following relations hold for a generic function of time w{rj), 



dw ii'^^ 
drj da 



n 



dw 



d(lna) 



(4.68) 



Based on the earlier discussion, the initial power spectrum P^i{k) is taken to satisfy 

A , for ifo < A; < kmax = W(ai) , 



k'^ Pipi (A^) 



27r2 



and zero otherwise, and we set 



^ = 1.0 X 10"^, 



(4.69) 



(4.70) 



which, for the sCDM model follows from the convention (see Eqn. (6. 100) of Ref. [12]) A = (55^^/3)^ 
with 5h ~ 2 X 10-5 (see, e.g. Ref. [77]). 
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(a) The dimensionless integrand of S^^\ namely the 
function {k/ Hq)^^^, at three sample values of the 
scale factor. The function dies down rapidly for 
large fc, with the value at some k being progressively 
smaller with increasing scale factor. The declining 



behaviour of the curves for a 
extrapolates to large k. 



Qeo and a = 200ae 
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(b) The dimensio nless C DM density contrast. To- 
gether with Fig. 4.2(a) this shows that nonlinear 
scales do not impact the backreaction integrals sig- 
nificantly. 



Figure 4.2: Backreaction and nonlinearity. 



Consider Figs. 4.2(a) and 4.2(b) , which highlight two issues discussed earlier. Fig. 4.2(a) shows 
the integrand of S^^^ at three sample epochs, and we see that the integrand dies down rapidly at 
increasingly smaller k values for progressively later epochs. (The other integrands, not displayed 
here, also show this rapid decline for large k.) [We have not shown the integrand at the later two 
epochs for all values of k since this was computationally expensive, but the declining trend of the 
curves can be extrapolated to large k, which is well understood analytically [12] •] This justifies the 
statement in the beginning of this section, that at any epoch a^, it is sufficient to have followed the 



evolution of scales satisfying K < 5000£'(a*) for computing the integrals. Secondly, Fig. 4.2(b) 



shows the behaviour of = j4^/^|5fc| at the same three epochs, and comparing with Fig. 



4.2(a) we see that at any epoch, the region of /c-space where linear PT has broken down, does not 



contribute significantly to the integrals. This is in line with the conjecture in Ref. [80] that the 
effects of the backreaction should remain small since the mass contained in the nonlinear scales is 
subdominant. We will return to this issue in chapter 5. 

Due to the structure of the integrals and the chosen initial power spectrum, it is convenient to 
compute the integrands in Eqns. (j4.52p equally spaced in (Ini^), and then perform the integrals 
using the extended Simpson's rule [75]. If 2^ -|- 1 points are used to evaluate a given integral, 
resulting in a value In say, then the error can be estimated by computing the integral with 2^~^ + 1 
points to get Tn-i, and estimating the relative error as \Ijsi-i/1n\ — 1. With N = 10, the estimated 
errors in all the integrals at all epochs were typically less than 0.1%. A bigger error is incurred in 
computing the integrand itself at any given epoch, leading to estimated errors of order ~ 1% in 
5(1)^ ^{i) and V^^'i +V^'^\ with a lar ger error in S^"^^ as explained below. 
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Figure 4.3: The correlation scalars ( "backreaction" ) for tlie sCDM model, normalised by H^{a). S^^\ V^^^ 
and 5^^^ are negative definite and their magnitudes have been plotted. The vertical line marks the epoch of 
matter radiation equality a = a^q- 

The second derivative 9^<I>fc/9(ln a)^ proves to be difficult to track numerically. At early times, 
when most scales are superhorizon, the Kodama-Sasaki analytical solution (j4.67p is a good approx- 
imation for most values of k. Using this one can see that at early times the value of the derivative 
is numerically very small, and is difficult to reliably estimate due to roundoff errors. For this rea- 
son the integral 5^^^ could not be accurately estimated at early times. However, the structure of 
the integrand of S'^'^^ ()4.52dp shows that the largest contribution comes from large (superhorizon) 
scales (the small scales being subdominant due to the presence of $fc and l/k"^). An analysis using 
the Kodama-Sasaki solution then shows in a fairly straightforward manner that the behaviour of 
the backreaction term is \S^'^'> /H"^] ~ 10"^(a/aeg)(//o/A; 

min) foi' our choiccs of parameters, where 
(a/aeq) <C 1. At intermediate times around a ~ Ogg and later, although it becomes computationally 
expensive to obtain convergent values for the second derivative at all relevant scalejf], moderately 
good accuracy (1-5%) can be achieved. 

The results are shown in Fig. 14.31 iu which the magnitudes of the correlation integrals of Eqn. 
(j4.52p . normalised by the Hubble parameter squared H'^{a) = iJ-L/a)'^ are plotted as a function 
of the scale factor in a log- log plot. The values for ^'•^^ are shown only for epochs later than 
a ~ O.Olaeg ~ 10~^. We see that at all epochs, the correlation terms remain negligible compared to 
the chosen zeroth iteration background. Also, in the radiation dominated epoch all the correlation 
scalars (except 5*^^^ whose evolution couldn't be accurately obtained) track the ~ behaviour of 
the background radiation density (see also Ref. [28] ) . The discussion above shows however that the 

^Convergence was tested by varying a global parameter which dynamically controls the stepsize during evolution 
(by stepsize doubling/halving, see Ref. [78]). The integrals other than 5^^' show convergence at 3 or more significant 
digits for ah epochs, whereas convergence can be obtained for S^'^^ only at epochs sufiiciently close to matter radiation 
equality, and there only for 1-2 significant digits, by setting stringent conditions on stepsize doubling. 




66 



magnitude of 5^^-* is far smaller than the other backreaction functions at early times, for a cutoff 
at kmin = Hq. On the other hand, in the matter dominated epoch 5^^^ dominates the backreaction 
and settles into a curvature-like ~ behaviour (note that in the matter dominated epoch we 
have ~ a~^). As for the signs of the correlations, S'^^\ S'^'^\ and P*-^-* are negative throughout 
the evolution while "P^^^ + V^"^^ is positive throughout. 

Finally, a few comments regarding the effects of ignoring baryons, nonlinear corrections, etc. 
Including baryons in the problem (with a background density parameter of VL^ ~ 0.05) will lead 
to a significant suppression of small scale power (by introducing pressure terms which will tend 
to wipe out inhomogeneities) and also a small suppression of large scale power. This effect causes 
a (downward) change in the late time transfer function of roughly 15-20% jT2], and therefore 
cannot increase the contribution of the backreaction. Quasi-linear corrections can lead to significant 
changes in the transfer function, but do not cause shifts by several orders of magnitude (see Ref. |81J 
and references therein). Hence accounting for changes due to quasi-linear behaviour will also not 
increase the magnitude of the backreaction by a large amount. As for effects from fully nonlinear 
scales, we have seen that these can be expected to remain small (see also chapter 5). 

Adding a cosmological constant (and retaining a flat background geometry) will change the 
qualitative features of the correlation functions by shifting the scale keq (due to a reduced f^m; 
which will also increase the power spectrum amplitude [12], but again not by orders of magnitude). 
Also, the late time behaviour of the correlation scalars will be affected since the potential will 
decay at late times instead of remaining constant (see also below). Regardless, the backreaction is 
expected to remain small even in this case (which is also indicated by the calculations of Behrend 
et al. [l9] in the Buchert framework). 

Chapter summary and discussion: 

This chapter showed how the spatial averaging limit of MG can be adapted to the case of linear 
perturbation theory (PT) in cosmology, to calculate the backreaction in gauge invariant manner. 
In doing so we also saw the significance of volume preserving gauges in defining self-consistent 
averaging operators. The formalism leaves some freedom in the choice of the averaging operator 
(via a choice of a volume preserving gauge), and therefore we cannot claim that our results are 
unique down to all numerical factors. However, since the final explicit form of the late time 
backreaction in Fig. 14.31 matches closely with the (essentially order of magnitude) estimate of Eqn. 
(j4.62p which is actually independent of any details of the averaging procedure, we do expect our 
results to be qualitatively robust. 

Fig. 14.31 also shows that in the absence of a cosmological constant, the backreaction after a 
single iteration of the procedure outlined in section 4.1, tracks the radiation density in the radiation 
dominated era, and essentially behaves like a curvature term in the matter dominated era. Two 
issues arise from this behaviour. Denote the corrections to the Friedmann equation (|A.3ap and the 
acceleration equation ()A.3bp as Cp and Cacc respectively. Then firstly, we find that in the radiation 
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dominated era, although both Cp and Cacc behave like ~ a~^, their numerical coefficients do not 
combine so as to preserve the conservation criterion (j3.64p separately. Since the backreaction must 
now necessarily couple to the background radiation density, this points to a very tiny gravitationally 
induced correction in the equation of state for radiation. This effect can be traced back essentially 
to the presence of a small but non-zero correlation 3-form and 4-form, arising from higher order 
perturbative effects. In the matter dominated era, at least in the "zeroth" iteration, this effect seems 
to be highly suppressed since we now have Cp ~ and \Cacc\ \Cf\, which is approximately 
consistent with (j3.64p . 

The second issue concerns what happens at higher iterations, and is important from the point 
of view of obtaining a convergent answer for the backreaction. The basic cycle that one needs 
to keep in mind is that the backreaction affects H^, which affects the equations for the density 
and metric perturbations, which in turn define the backreaction. Consider the situation in the 
matter dominated era, which is easier to handle since firstly only one term S^^^ contributes to the 
backreaction and secondly the linear PT solution has a simple analytic form. The estimate in Eqn. 
(j4.62p shows that most of the contribution comes from (quasi) linear subhorizon scales k ~ keq for 
which the Poisson equation holds, so that if the density contrast behaves like 5k ~ D{a) then the 
metric transfer function behaves like ~ D/a. Standard linear PT [T2] tells us that D{a) is 
the so-called growth function which can be written as D E J da/{aE)^ upto some numerical 
coefficient, where E = H/Hq. An analysis similar to the one leading to Eqn. (|4.62p then shows 
that we should expect Cp ~ a^^{D/a)'^ at late times. For a flat universe without a cosmological 
constant, D{a) = a and we recover the single iteration result that we have been discussing so far. 

The crucial thing to note is that since the backreaction affects only the background equations 
and not the perturbation equations, D{a) is completely determined by the Hubble parameter 
H{a) = HQE{a), so that at any iteration i we will have 

(£(*+i))2 = fi(;+i)a-3 + eS„n«"'P^'Va)' ; D^'\a) ~ E^'^ j da/{aE^'^f , (4.71) 

where we expect e[2rxn ~ 10"'*. This immediately suggests that the limit of this series is the 
solution of the integral equation 

E'^ = Qma'^ + ebkrxna"^(i:?/a)2 ; D{a) E J da/{aEf . (4.72) 

This equation can in principle be solved perturbatively by exploiting the smallness of the parameter 
ebkrxri) and we expect the solution to be close to the "zeroth" iteration answer ~ a. To understand 
why, notice that at the zeroth iteration we found 5(1) to be negative, so that the first iteration 
Hubble parameter E^^^ (a) is effectively that of an open universe with a small negative curvature. 
Standard analysis shows that the growth factor in an open universe is suppressed compared to that 
in a flat matter dominated one, and hence the Hubble parameter at the second iteration 

E(2)(a) will 
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have a slightly smaller contribution from the backreaction than E^^\a). This will correspondingly 
slightly enhance the contribution of the backreaction to E^^\a) over the contribution to E^'^\a), 
and so on until the solution converges. 

This convergent solution will, like the radiation dominated case, mildly violate the conservation 
criterion Eqn. (j4.62p . Further, the analysis above generalises to the case when the cosmological 
constant is nonzero. In this case the late time growth factor is suppressed compared to the EdS 
case even at the zeroth iteration [12J, and the convergent solution will violate the conservation 
criterion by an amount comparable to the backreaction itself. What is important however is that 
in all cases, the backreaction as well as the violation of matter conservation remain negligibly small, 
approximately at the level of one part in 10^. This analysis ignored all contributions from scales 
which have become fully nonlinear in the matter density contrast at late times. The reasoning 
was that these scales are not expected to contribute significantly to the backreaction due to a 
suppression in the transfer function In the next chapter, we will confirm this expectation in 
the context of a toy model of nonlinear structure formation. 
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Chapter 5 

Nonlinear structure formation and 
backreaction 



We have seen so far that applying the averaging framework during epochs when hnear perturbation 
theory (PT) is expected to be vahd, leads to only negligible modifications in the standard cosmo- 
logical equations. This is not unexpected, since the estimate in e.g. Eqn. (j4.62p does not depend 
crucially on the details of the averaging procedure, and is therefore robust. Most of the interest 
in the backreaction issue however, has been from the point of view of late time cosmology when 
matter fluctuations at least have become nonlinear on small scales. It has been claimed using some 
simple models of structure formation [37] that using the perturbed FLRW ansatz for the metric 
is no longer a good approximation in this regime, and that one should expect backreaction effects 
to grow large at these times. We will investigate this issue in this chapter, in the framework of 
an exact toy model of structure formation based on the LTB solutions of general relativity (see 
Appendix B). 

We have already seen in our linear PT calculation that if the metric at late times continues 
to be of the perturbed FLRW form (j4.6p . then backreaction effects of the nonlinear scales are in 
fact likely to be small, which follows from studying the structure of the integrands of the various 



backreaction functionals (see the discussion of Figs. 4.2(a) and 4.2(b)). Order of magnitude ar- 
guments such as those in Ref. [27] suggest that the perturbed FLRW metric does remain a good 
approximation at late times. In this chapter we will see an explicit demonstration that a fully 
relativistic, highly nonlinear collapsing system can be described by a perturbed FLRW metric, 
provided peculiar velocities of the matter remain nonrelativistic. Further, in our model (which will 
satisfy this condition) we can then use the formalism developed in chapter 4 to explicitly compute 
the backreaction. We will see that the backreaction in this case does remain small. 
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5.1 Spherical Collapse : Setting up the model 



Our model is based on the LTB solution described in Appendix B, and is completely determined 
once the initial conditions are specified. 

5.1.1 Initial conditions 

While choosing the initial density, velocity and coordinate scaling profiles, we make the important 
assumption that at initial time, a well-defined global background FLRW solution can be identified, 
with scale factor a{t), Hubble parameter H{t) and density Pb{t). This is reasonable since the CMB 
data (combined with the Copernican principle) assure us that inhomogeneities at the last scattering 
epoch were at the level of 10 parts per million. This assumption plays a crucial role in deciding 
which regions are overdense and will eventually collapse, and which regions will keep expanding. 

• Initial density profile p{ti,r) : 
The initial density is chosen to be 

(1 + (5*), r < 

p{ti, r) = pbi < (1 - ^v), n <r <ry (5.1) 
,1, r>ry, 

where p^i = pb{U)- Initially, the region r < r* is assumed to contain a tiny overdensity and 
the region r* < r < r^, an underdensity. In other words, 

0<S^,6y^l. (5.2) 

The discontinuities in the initial density profile can be smoothed out by replacing the step 
functions appropriately. We will not do this here, since the step functions make calculations 
very simple. This is not expected to affect the qualitative features of our final results. 

• Initial conditions on scaling and velocities : 

We match the initial velocity and coordinate scaling to the global background solution, by 
requiring 

R{U,r) = air, (5.3) 
R{ti,r) = GiHir , (5.4) 

with ai and Hi denoting the initial values of the scale factor and Hubble parameter respectively 
of the global background. This amounts to setting the initial velocities to match the Hubble 
flow, ignoring initial peculiar velocities. This is only a convenient choice and the introduction 
of initial peculiar velocities is not expected to modify our final results qualitatively. 
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For the FLRW background we consider an Einstein-deSitter (EdS) solution with scale factor and 
Hubble parameter given by 



a{t) = {t/tof'^ ■ to = 2/{3Ho), 
H{t) = a/a = 2/(3t), 

with tQ denoting the present epoch. Oj fixes the initial time as 



U = 2/{3Ho)a: 



3/2 



(5.5) 
(5.6) 



(5.7) 



We will always use Oj = 10~^, so that the initial conditions are being set around the CMB last 
scattering epoch; in general Oj must be treated as one of the parameters in the problem. The initial 
EdS background density is given in terms of Hq and Oj as 



Pbi 



SttG 



:ma 



(5.8) 



5.1.2 Mass function Af(r) and curvature function k{r) 

We now have enough information to fix M{r) and k{r). Using Eqn. ()B.2bp at initial time together 
with the scaling in Eqn. (15.30 gives us 

GM{r 



1 



l + (5*, < r < r* 
1 + 5, ({rjrf - l) 
^ l + {5Jr^){rl-rl) 



< r < 
r > r„ , 



(5.9) 



where we have defined a "critical" radius Tc by the equation 



1 + 



(5.10) 



The significance of Tc will become apparent shortly. Using the initial conditions Eqns. (15. Sp and 
(j5.4p in the evolution equation ()B.2aP at initial time, gives 



k{r)r 



2 _ 2GM{r) 



air 



2tt2 2 



(5.11) 



with Hf = H^a^ ^, and hence 



tt2 

k{r) = { 
ai 



(5*, r < 
\3 



(5.12) 



72 



The significance of Tc is now clarified. Since 5*, > 0, we have Tc > by definition (Eqn. (jS.lOp ). 
The following possibilities arise : 

• If > r^, then k(r) > for all r, and every shell will ultimately collapse, including the 
"void" region < r < r^. 

• If Tc < r„, then k{r) > for r < Vc and changes sign at r = r^. Hence, the region r^, < r < rc 
will collapse even though it is underdense, while the region r > Vc will expand forever. 

• If Tc = r„ , then the "void" exactly compensates for the overdensity, and the universe is exactly 
EdS for r > r„. [GM{r) = {l/2)H^r^ and k{r) = 0.] Also the "void" will eventually collapse. 

Clearly the most interesting case for us is the one with Tc < r^,, and we will hence make this choice 
for our model. We realize that the model as it stands is not a very realistic depiction of the (nearly 
spherical) voids we see in our Universe [56], since these voids are seen to be surrounded by "walls" 
of matter. However, our goal is to describe two regions, one of which collapses while the other 
expands ever more rapidly, and our model is capable of doing so while retaining its fully relativistic 



Although we have set up the model for all values of the radial coordinate r, hereon we will 
concentrate on the region < r < One reason is that most of the interesting dynamics takes 
place in this region. Another is that the region r > r„ develops shell-crossing singularities due to 
the sharp rise in density across r = r„. A more realistic model would be able to incorporate the 
pressures that are expected to build up when a shell-crossing occurs [82], but the LTB model is 
limited in this respect due to its pressureless character. We will therefore ignore the region r > r^. 

5.1.3 The solution in the region < r < Vy 

The region of interest can be split into three parts : region 1 = {0 < r < r^,}, region 2 = {r* < r < rd 
and region 3 = {rc < r < r^}. The solution in the three regions is as follows : 

• region 1 (0 < r < r*) : 



character. 





(5.13b) 



(5.13a) 



1 — cos Ui = 



1 + 6,' 



(5.13c) 




(5.13d) 
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For regions 2 and 3, it is convenient to define a function e{r) as 



• region 2 (r* < r < rc) : 

1 /fflj 



R=- (Zljr(l + e)(l-cosa), 
2 V £^ 



a — sm a 



2Ho ( e 



1 — cos Qi(r) 



1 + e \ai 
2e 



3/2 



(t - ti) + (oi - sinaj) , 



1 + e 



re 



,3/2 



e(l + e) I (1 — cos aY 
^e^l"^ ( sina 

+ 



-ffi(i — fj) sin a 



1 + eY V sina 



3 + e 
1 + e 



(5.14) 



(5.15a) 
(5.15b) 
(5.15c) 



(5.15d) 



region 3 {r^ < r < r^^ : 
\ ( a 



R 



2 \ e 



r(l + e)(cosh rj — 1) 



sinh r] — r] 



2Ho f\e 



cosh.rii{r) — 1 



R^R! =^[l 



1 + e \ Oj 
2|e| 



3/2 



{t - ti) + (sinhr/i - r/j) 



1 + e 



re 



1 



,|3/2 



e(l + e) I (cosfi?? — 1)2 

4|e|^/2 / sinliT/ 
(1 + e)2 \sinli7/j 



Hi{t — ti) sinlir; 



3 + e 
1 + e 



+ 



(5.16a) 

(5.16b) 
(5.16c) 



(5.16d) 



Tlie crossover from region 1 to region 2 is discontinuous in R' (but not in R) due to our discontinuous 
choice of initial density. Smoothing out the density will also smooth out R' . The crossover from 
region 2 to region 3 can be shown to be smooth, by considering the limits r — > r~ and r ^ r+ 
or equivalently e ^ 0" and e ^ 0+. Note that the results in Eqns. ()5.13p . ()5.15p and (I5.16|) are 
exact, and do not involve any perturbative expansions in 5^ or even though these parameters 
are small. 
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Parameter name 


Parameter value 


ai 


0.001 


Ho 


l/13.59Gyr-^ (= 72 km/s/Mpc) 


to 


2/(3ifo) = 9.06Gyr 


c 


306.6 MpcGyr"^ 


(5* 


1.25ai(37r/4)^/^ = 2.21 x 10"^ 




0.005 




0.004c/i7o = 16.7 Mpc 


tturn / to 


0.72 


rc 


n (1 = 18.8 Mpc 




1.25rc = 23.5 Mpc 


R{to,r*) 


6.8 Mpc 


Rito,rv) 


33.3 Mpc 



Table 5.1: Values of various parameters used in generating plots. 



5.1.4 Behaviour of the model 

Each shell in the inner, homogeneous and overdense region 1 behaves as a closed FLRW universe, 
expanding out to a maximum radius Rmaxir) given by 

RmaAr) = ^r{l + 5,) . (5.17) 

All the inner shells reach their maximum radius and turn around at the same time ttum given by 

Wn = t. + ^(g) (vr-(n.-sinn,))..to(^)(g) , (5.18) 

where we have used the smallness of a, and 5^ to make the last approximation. By appropriately 
choosing a value of 5^, , we can arrange for the turnaround of region 1 to occur either before or after 
the present epoch. 

In Table 15.11 we have listed the parameter values which we will use frequently in displaying 
plots. Along with the parameter set {a^, Ho,6^,6y,r^,rv}, we have also listed the values of the 
derived quantities {rc,ti,to,tturn} and speed of light c in units of MpcGyr"^. We have also shown 
the values of the present day physical area radius R{to,r) at r = and r = r^. The density 
contrasts are to be understood to reflect the inhomogeneities in the dark matter density close to 
last scattering, and not the inhomogeneities of the baryons which were much smaller |12j . In Fig. 
15.11 we have shown the evolution of the density contrast S{t,r) defined in the usual way by 

l + S{t,r) = ^^, (5.19) 



75 



14 r 








10 



12 



8 



-0.1 
-0.2 
-0.3 



5 



6 -0.4 




6 



-0.5 
-0.6 
-0.7 
-0.8 



4 



2 







0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 



0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 



(a) 



(b) 



Figure 5.1: The evolution of tlie density contrast d{t,r), using parameter values from Table EH] evaluated 
at (a) r — r*/2 in region 1 and (b) r ^ (tc + rv)/2 in region 3. 

for the parameter choices of Table \57L\ for which one has ttum/to — 0.72, so that the collapse is well 
under way in region 1 at the present epoch. The two panels show the contrast for two representative 
values of r, one in region 1 and the other in region 3. 

5.1.5 Aside : Acceleration from initial conditions 

It is interesting to note that our model is capable of qualitatively reproducing results derived by 
earlier by Rasanen [S^ in the context of a very simple model of structure formation. Rasanen's 
model can be summarized as follows : one considers two disjoint regions, one overdense and the other 
completely empty, each evolving according to the FLRW evolution equations. (The embedding of 
these regions in an FLRW background, and the behaviour of the region between these two regions, 
is not considered.) The scale factor in the overdense region therefore behaves as oi oc (1 — cosn) 
with t (X {u — sinn), and the scale factor in the empty region behaves as 02 oc t. It is then 
straightforward to show that if one defines a volume averaged scale factor by = of + , then the 
effective deceleration parameter given hy q = —{aa)/a? becomes negative (indicating acceleration) 
around the time that the overdense region turns around and starts collapsing. 
If we define the volume of each of our three comoving regions separately, as 



then the total volume of the region can be used to define a "Buchert-style" volume averaged scale 
factor as 




(5.20) 
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and hence an effective deceleration parameter q given by 



da VV , , 

-^ = 2-3^, (5^22) 



wliereas Rasanen's model can be mimicked more closely by ours, if we simply remove the region 
2, by hand. By doing so we are left with two disjoint regions, each spherically symmetric, one of 
which is collapsing and the other expanding ever rapidly and becoming ever emptier. There is no 
physical reason to throw away region 2 in this manner, but for the sake of comparison we will define 
a "modified" scale factor amod and it's corresponding deceleration parameter Qmod by 

Vi{t)+V3{t) _ dmodamod 



a-niodV') = T/ ^ I TA \ ' Imod = • 

V^i(io) + V3(to)y a^d 

In Fig. I5.2l we plot q{t) and qmod{t), for several sets of initial conditions which are close to our "base 
set" listed in Table ET] (except for Fig. 5.2(d) which has a large value for S^j^. The various initial 



conditions correspond to turnaround times that are slightly greater than, or slightly less than, or 
significantly less than the present epoch. The results are therefore valid regardless of whether the 
collapse has just begun or is well under way at the present epoch. We see that while the modified 
scale factor does accelerate as in Rasanen's model, the scale factor a{t) does not show this effect. 
The reason for this can be understood as follows. The region 2 is of a rather peculiar nature - it 
is underdense initially and becomes emptier with time, however its evolution is closely linked to 
that of the overdense region 1. Namely, the whole of region 2 (except its boundary at r = rc), 
is dragged along with region 1 and eventually turns around, instead of expanding away to infinity 
like its counterpart region 3. Now, if one ignores region 2, then Rasanen's arguments about the 
remaining two regions stand - one region is contracting and the other is expanding faster than the 
global mean, and this stand-off leads to an acceleration of the effective scale factor amod, as we see 
in the plots of Fig. 15. 2i But if we account for region 2 as well, then we bring in a counter-balancing 
influence of a large underdense volume which is expanding slower than average, and this reduces the 
accelerating influence of region 3 to the point of making the effect completely disappear. Note that 
at late times, the volume of region 1 contributes negligibly to the total volume, and the volumes of 
regions 2 and 3 are comparable. 

We wish to highlight two points. First, it is very important to note the role played by the initial 
conditions in this entire excercise. The function k{r) is defined in a continuous fashion once the 
initial density, velocity and coordinate scaling are given, and k(r) then decides which shells will 
eventually collapse and which will not. The continuity of k{r) assures us that in models such as 
ours, with an overdensity surrounded by an underdensity, the underdense region will always contain 
a subregion in which k(r) > 0. We see therefore that the existence of region 2, is a generic feature 



'^Both curves in Fig. |5.2(d)| begin at q ~ 0.5 ai t — ti. To enhance the contrast between the curves, we have 



plotted them for times t > Q.lSttum- The remaining plots (Figs. 5.2(a) ■5.2(c) I are plotted starting from t = U 
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Figure 5.2: The deceleration parameters for a range of parameter values. The dashed lines correspond to 
Qmod and the solid lines to q. The a;-axis shows t/ttum, where Uum is the time at which region 1 turns 
around, and is different for each plot. The values for a,;, Hq and c are the same as those listed in Table lOI 
[Both curves in Fig. 5.2(d) begin at g ~ 0.5 at i = t^.] 



not restricted to our specific choice of discontinuous initial density or vanishing initial peculiar 



velocities. Further, as we see in Fig. 5.2(d) , it is possible to make q deviate even more significantly 
from the EdS value than the ~ 10% effect of the first three figures, but this requires an unnaturally 
high value of 5y > 0.01 (the figure has 5y = 0.05), which contradicts CMB data. Secondly, one may 
argue about the "naturalness" of choosing one set of regions over another set, in order to compute 
volumes. But this itself places the physicality of the acceleration effect into question - if one has 
to judiciously choose a specific set of averaging domains in order to obtain acceleration on average, 
then the effect would appear to be an artifact of this choice rather than something which observers 
would see. 



5.2 Transforming to Perturbed FLRW form 

We now turn to the main calculation of this section. We ask whether the LTB metric ()B.ip for our 
model can be brought to the perturbed FLRW form with scalar perturbations, at any arbitrary 
stage of the collapse. Namely, we want a coordinate transformation (t, r) (r, f) such that the 
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metric in the new coordinates is 



ds^ = -(1 + 2^)dT'^ + a2(r)(l - 2i)) {df^ + r^dn^) , (5.24) 
with at least the conditions 

I 99 |< 1 ; \ip\^l, (5.25) 

being satisfied. We will ignore conditions on the derivatives of ip and if for now (see the end of 
Section 3.2). The scale factor is the EdS solution, with r as the argument. The coordinate f is 
comoving with the (fictitious) background Hubble flow, but not with the matter itself. On physical 
grounds we expect that this transformation should be possible as long as the gravitational field is 
weak and matter velocity is small. We will see below that this is exactly what happens. In the 
new coordinates, all matter shells labelled by f expand with the Hubble flow, with a superimposed 
peculiar velocity. 

Since we want f to be comoving with the background, the natural choice for this coordinate 
would be f ~ R/a, at least at early times. Also, we need to account for the local spatial curvature 
induced by the initial conditions. As an ansatz for the coordinate transformation therefore, we 
consider the equations 

r = ^^{l + at,r)) , (5.26a) 
a{t) 

r = t + e°(i,r), (5.26b) 
where ^{t,r) and C^{t,r) are expected to satisfy 

I e I < 1 ; \fH\^l. (5.27) 

This form of the transformation keeps us close to the standard gauge transformation of cosmological 
perturbation theory, while still accounting for the deviations in the evolution from the background 
FLRW, caused by structure formation. We will show that a self-consistent transformation exists, 
which preserves the conditions (j5.25p and (|5.27|) for most of the evolution. We will use the metric 
transformation rule given by 

and expand to leading order in the small functions ^, C^H, (p, ip and also k{r)r'^ which, as we see 
from Eqn. (j5.12p . remains small in the entire region of interest. The relations in Eqn. (j5.28p 
must be analysed for t he cases [ij] = {{tt),{tr),{rr),{99)}, in each of the three regions. (The 
remaining cases can be shown to lead to trivial or non-independent relations.) The analysis is 
similar to the standard gauge transformation analysis in relativistic perturbation theory [12] . Since 
the calculations involved are straightforward but tedious, we will only present an outline of the 
calculation and highlight certain issues. At the end we will present equations for all three regions 
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and numerically show that the transformation is well-behaved in the regime of interest. 
The case (ij) = (96) is easily analysed and leads to 



i^ = eH + ^, (5.29) 

The cases (ij) = (tr) and {tt) both require | dtr |<C 1 for consistency (since the RHS of Eqn. (j5.28p 
in these cases has no zero order term to balance a large dtr). Note that since t is the proper time 
of each matter shell, the quantity Off is simply the velocity of matter in the (r, f) frame (which is 
comoving with the Hubble flow). In other words, 

is the radial comoving peculiar velocity of the matter shells in the (r, f) frame. We will soon see 
that whereas the quantities ^ and behave roughly as ~ {H^r)^ , the peculiar velocity av behaves 
roughly as ~ {Hqt). We will therefore treat (au)^ as a small quantity of the same order as ^, etc. 
The case {ij) = {tr) then leads to 

i^' = avR!, (5.31) 



the case (ij) = {tt) gives 

and the case {ij) = {rr) givej§ 



^ = -i'' + ]^{av)\ (5.32) 



i' = \ {k{r)r' + {av)') . (5.33) 

The equations (|5.29p . (j5.33p . (|5.3ip . and (|5.32p are valid in the entire range < r < r„, provided 
the peculiar velocity remains small in magnitude. The comoving peculiar velocity is given by 

V = dt , (5.34) 

where we have assumed for consistency that \dt{R/a)\ <C 1 and have dropped the term {R/a)^ 
since it is expected to be of higher order than dt{R/a). (This can be seen from simple dimensional 
considerations - we have dt{R/a) ~ HR/a, and since, from Eqns. (j5.33p and (|5.12p . ^ ~ {HR)'', 
we also have {R/a)^ ~ {HR)^/a.) We will see that these conditions do indeed hold for most of the 
evolution, throughout the region of interest. 



This corrects an error in Eqn. 35 of Paper 4. I am grateful to Karel Van Acoleyen for pointing this out to me. 
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Figure 5.3: The quantity av/{Hor) in region 1, plotted using parameter values from Table [STTl Since 
(ffo?'*/c) ^ 0.001, the peculiar velocity av remains small. 

5.2.1 The transformation in region 1 

Since region 1 corresponds to a homogeneous solution, the integrals in Eqns. ()5.3ip and (15.330 can 
be analytically performed. Since R has the structure R = ryi{u{t)), we have av = r{yi — yiH) and 
Eqn. (j5.31|) then leads to 

^^ = -avR, (5.35) 
after setting an arbitrary function of time to zero, while Eqn. (|5.33p gives 

(5.36) 

after setting another arbitrary function of time to zercE|. The peculiar velocity can be explicitly 
calculated to be 

a{t)v{t,r) = {H,r) 

where the various functions are defined in Eqns. (|5.13p . and we have defined the constant B by 

B ^ ^ f ^y^' - [u, - sinu.) . (5.38) 
1 + 0* \aij 

In the rest of this section we will use the parameter values listed in Table 15. li In Fig. 15.31 we 
have plotted av/{HQr) in region 1. We see that this dimensionless quantity remains of order ~ 1 

^Note that it might be more meaningful to fix the two arbitrary functions of time £^{t, 0) and ^"(f, 0), by requiring 
that ^{t,rc) and ^''^{t,rc) vanish. This would be in line with the shell r — rc expanding like the flat EdS background. 
However, this complicates some of the expressions we evaluate, and does not change the order of magnitude of any of 
the final results. Hence we will continue to assume that the transformation functions ^ and vanish at r = rather 
than at r = rc. 



-Hi + (yi - yiH) 



smti 



cos u 



(1 — costi) 3(u — sinu + B) 



(5.37) 
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throughout the evolution. For our choice of {HQr^:/c) 10~^, which corresponds to an overdensity 
spanning a few Mpc today, the peculiar velocity is of order ~ 10~^ in region 1. 
Direct calculation shows that (p and V' are equal and given by 



It is not hard to see that for our parameter choices, ip and ^p remain of order ~ [H^rY ~ 10^® 
for most of the evolution (the 1/(1 — cosu) factor will start becoming significant only at very late 
times which are larger than to for our parameter choices). The fact that ip = ij: is'va. fact a general 
result which follows from the absence of anisotropic stresses in the problem. It can be shown (see 
e.g. Refs. |35l I12j ) that the difference between ip and ip is governed by the stress-tensor component 
Tqq which vanishes for the spherically symmetric dust we are considering^. This is fortunate, since 
the form of ^ in regions 2 and 3 is complicated, and is cumbersome to evaluate numerically. All 
we need however is {av) which can be directly evaluated, and which can be found after one 
integration in (j5.3ip . These are sufficient to determine ip and the form for immediately follows, 
assuming that ip and ij) vanish at the same radius r (which in our case is r = 0). 

5.2.2 The transformation in regions 2 and 3 

For the calculation in regions 2 and 3, the integrals involved cannot be computed analytically. We 
will therefore display the expressions we obtain for v and and plot the results of numerically 
computing ip = ip from these quantities. 

• region 2 (r* < r < Tc): 
In region 2 we have 



2 4 (l-cosu)2 



(5.39) 



1 — cos u 9 (u — sin u + i?)^ 



v = - [C{t, r) - H] 



(5.40) 




(5.41) 



where C''(t,r^,) is computed from Eqn. (j5.35p at r = r,,,, and we have defined 



C{t,r) 



Hi sin a 

2^3/2 

(1 — cos a)2 1 + e ' 



(5.42) 



ip must now be computed using (|5.32p . 



''This is independent of whether we use {t,r) or {r,f), since the angular coordinates are not affected by this 
transformation. 
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( t/to ) 



Figure 5.4: The peculiar velocity avjc in regions 2 and 3 using parameter values from Table WX 



region 3 (jc < r < r^) : 

The analysis is very similar to that in region 2. We find 



R 



f{t,r)=£,'^{t,rc)+ait) / v{t,r)R'{t,r)dr 



(5.43) 
(5.44) 



where S,^(t,rc) is obtained from (|5.4ip . evaluated in the limit r ^ , and we have defined 

Hisinhi] 2|e|'^/^ 



D{t,r) 



(coshr/ — 1)^ 1 + e 



(5.45) 



In Fig. 15.41 we have plotted the velocity av/c in regions 2 and 3 for a range of time. It can be 
shown that at the order of approximation we are working at, av changes sign at r = rj§. In Fig. 
15.51 we plot if. We see that this function is well behaved and remains small for the entire region of 
interest (in space and time). Hence the perturbed FLRW picture is indeed valid for this system, 
even though each region by itself appears to be very different from FLRW in the synchronous 
coordinates comoving with the matter. Due to numerical difficulties close to the initial time t = tj, 
we have plotted the time axis starting from t = bOti. Note that the magnitude of (p is sensitive 
to the overall size of the region, determined by the value of R{t,ry). For our parameter choices 
given in Table ETTl the size of the region at the present epoch is ~ 33Mpc, which is a typical size 
for observed voids. The dependence is roughly (HR)^, and hence a void which is about 10 times 
larger in length scale than the above value, would have metric functions about 100 times larger. 



^Recall e(rc) = and hence this shell expands exactly like the EdS background. The metric in the (r, f) coordinates 
will not be exactly EdS at r = rc, due to our unusual choice of normalisation for ^ and at r = 0. This does not 
pose any problem for our conclusions. 
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Figure 5.5: The metric function Lp(t, r) in regions 2 and 3 using parameter values from Table [?TT] The time 
axis begins at t = 50ti. 



We end this subsection by noting the following. It is known that simply having a metric of the 
form (j5.24p with only the magnitude of the perturbations being small, is not enough to guarantee 
consistency with Einstein's equations written as a perturbation series; additional constraints on the 
derivatives of these functions must be satisfied. These constraints, given in e.g. Ref. [27J, take the 
form (for the metric (j5.24p with ip = —<p), 

< J^V"(^V„vp , (V^ipV^iff < (v°V^vp) Va^iB^ , (5.46) 

where a,(3 = 1,2,3, and is the spatial covariant derivative associated with the flat 3-space 
metric. On dimensional grounds, treating ip ~ {HRf <C 1, 5t ~ and V ~ aR-\ it is easy to 
see that these constraints will be satisfied by our solution. This should also be expected since we 
started from an exact solution of the Einstein equations and performed a self-consistent coordinate 
transformation . 




5.2.3 The magnitude of the backreaction 

One can now legitimately ask the question, "How large is the effect of the small metric inhomo- 
geneities?" Naively, one would argue that small inhomogeneities must lead to small effects. Indeed, 
the question of the magnitude of the backreaction in the Newtonianly perturbed FLRW setting 
has been investigated by Behrend, et al. [49j in the linear and quasilinear regimes, and they find 
that corrections to the FLRW equations remain at the level of one part in 10^. However, what 
we are dealing with is a situation in which the matter perturbations are completely nonlinear, and 
it is not a priori clear that the same arguments would carry through. Indeed, we saw in section 
2 that the deceleration parameter q deviated from its EdS value by about ~ 10%. Here we give 
an argument based on dimensional considerations supplemented with realistic numbers, which will 
show that this effect is scale dependent, and is not expected to be present if a sufficiently large 
averaging scale is chosen. 
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In the following we will work at the present epoch tQ. Consider a model situation similar to the 
one we have been considering so far, such that at present epoch the physical extent of the overdense 
region is R^, and that of the underdense is Ry. For order of magnitude estimates, we assume that 
in the perturbed FLRW metric (j5.24p (which is valid for this system provided H^Ry <C 1), 93 ~ 
Also assume that the density contrast in the overdense region is 5^,o and that in the underdense 
region is 5^0, where we take and to be constant in space, which is fine for an order of 
magnitude estimate. The backreaction in the Buchert approach contains, among other terms, the 
spatial average of the quantity V^(/? which appears in the spatial curvature |30l I49j . where is 
the Laplacian operator for the flat 3-space metric. The spatial curvature has the structure 

^ ~ ^ [(#i)VV + (#2)'^VV + (#3)(V(^)2] , (5.47) 

where #i,#2 5#3 are constants whose values are irrelevant for this order of magnitude argument. 
Due to the Einstein equations in the small scale Newtonian approximation, the leading order effect 
in the nonlinear regime, comes from V'^ip which satisfies 

2 i H^S^o, overdense region , 
V 93 ~ < (5.4sj 
I Hq5vq , underdense region . 

Consider the situation when, at present epoch, i?^, 6Mpc, Ry ~ 30Mpc, 6^0 10^ and 6^0 ~ —0.9. 
These are typical numbers for clusters of galaxies and voids. It is straightforward to now show that 
the spatial average of V'^cp over a domain comprising the overdense and underdense region, works 
out to be 

~ -O.IH^ . (5.49) 

It would appear therefore, that this spatial average of V'^^p (which is usually neglected) thus turns 
out to be a significant contributor to the backreaction. (In fact it is the most significant contributor, 
since the other terms are clearly of at least one higher order in the small quantity {HqR^)'^, for 
such a model.) 

As we now argue, however, the above effect can be deceptive, and is really scale dependent. Let 
the initial density contrasts in the overdense and underdense regions be and 6yi respectively, 
so that liJuil <C 1. If M^,j, Myj, M^, and My are the masses at initial time and today, in the 
overdense and underdense region respectively, and pi and pQ are the values of the background 
density at initial time and today, then at initial time 

M„ ^ Pi{a^R,f = poRl , Myi^poRl, (5-50) 
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and at present time, 



M, = po{l + 6^)Rl > M^i , M„ = po{l + 6^)Rf, < M, 



(5.51) 



We now make the crucial observation that if the averaging scale is large enough, and we are counting 
several such "pairs" of overdense and underdense regions, then the mass ejected from the underdense 
region must have all gone into the overdense region. It is then easy to show, that 



which means that, just like in the linear theory, the average of V'^ip is expected to be negligible on 
such a scale. In the real universe, we do expect that the averaging scale must be at least of the order 
of the homogeneity scale, and on such a scale we will be sampling several pairs of overdense and 
underdense regions. The only cumulative effects that may arise with such a choice of scale are from 
terms such as (Vip)'^, which as we mentioned earlier, are of one higher order in the perturbation 
and will give effects of the size ~ Hq{HoR^)^ <C Hq. (For a demonstration of the scale dependence 
of the effect, see e.g. the work of Li and Schwarz, the first paper in Ref. [52].) 

5.3 Backreaction during nonlinear growth of structure 

We can do better than the estimates for the magnitude of the backreaction during late stages of 
structure formation. In chapter 4 we have already developed a formalism in place to calculate the 
backreaction whenever the metric has the perturbed FLRW form (irrespective of matter inhomo- 
geneities) . We can use this procedure on our LTB model in the (r, f ) coordinates to explicitly 
evaluate the backreaction functions. 

The expressions for the backreaction in Eqns. (j4.42p were derived under the requirement that the 
averaging operation be free of gauge related ambiguities, in linear perturbation theory. However, 
the actual conditions used to derive Eqns. (j4.42p only depended on the fact that one is working with 
leading order effects in the metric perturbations. In particular, a key step was the transformation 
(j4.39p between the metric in the conformal Newtonian gauge (in Cartesian spatial coordinates) and 
the corresponding volume preserving form. In the present context, the same transformation remains 
valid at the leading order, and hence the expressions ()4.42p for the backreaction are physically 
relevant here as well. We emphasize that this truncated averaging operation remains valid even at 
late times since the weak field approximation for gravity works well during the nonlinear phase of 
structure formation. 

Since our numerical results are in terms of the LTB variables {t, r), where r "comoves" with the 
matter but not with the FLRW background, we need to reexpress the averaging operation (j4.27p 
in terms of these variables. It is easy to show that, at the leading order, the average of a scalar 




'V 1 



(5.52) 
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s{t,r) defined in Eqn. (j4.27p can be written as 



3 



f 

Jo 



■rUt) 




(a(t)L)3 



sR^R'dr , 



(5.53) 



where the function solves the equation 



R{t,rL{t)) = a{t)L. 



(5.54) 



Eqn. (j5.53p gives the average of s over a single domain centered at the origin, which is what we will 
restrict ourselves to in this section. There are two reasons behind this choice : firstly this is the 
most natural choice given the symmetry of the system, and secondly since our model is constructed 
as a "typical representative" of nonlinear inhomogeneities, it makes sense to use averages over the 
single central region as representative of more general averages. As discussed in section 5.1, we are 
constrained to consider values r < due to unphysical shell crossing singularities in the region 
beyond. For this reason the largest value of L which we can choose is L = r^, which then ensures 
^L{t) < fv since rL{t) is a decreasing function for this choice. This gives us an averaging scale of 
L = 23.5 Mpc (comoving with the FLRW background), which is smaller than the more realistic 
expected value of ~ 100/i^^Mpc. One consequence is that our model does not strictly satisfy the 
condition that the potentials ip and ip and their spatial and time derivatives should average to zero, 
as was assumed in chapter 4. One can check that the actual average values of the form ( {Ba^) )^ are 
small (< 10%) compared to terms like ( {dA'f)'^ ) which are needed in the backreaction calculations, 
for all times, although it turns out that the time derivatives satisfy (v')^ ~ ((¥')^)j throughout 
the evolution. However, since the averaging scale chosen here is large enough to encompass all the 
inhomogeneity of this system, we expect that our estimates for the backreaction functions in Eqns. 
(j4.42p are fairly representative. 

Consider now the function 13. Since /? satisfies the Poisson equation = 99 — 3V' on a flat 3- 
space background, with no nontrivial solutions of the corresponding homogeneous equation allowed, 
we can directly write the solution for (3 in terms of the background radial coordinate f as. 



''' Jo Jf 

where q = —2ip (we have set = ^ at leading order) and the integration is over the spatial 
coordinates comoving with the background. The following relations turn out to be useful in the 





(5.55) 



87 



calculations, 



1 r 

dfP = / qy^dy , 
Jo 

2 2 

dfl3 = q drP , 

r 

c^l/? = -^dfl3 - -q + dfq . 



(5.56a) 

(5.56b) 
(5.56c) 



We will need the quantities df(3, dfj3 and 9|/3 as functions of the LTB variables (t, r), which can 
be done by replacing r and f at leading order by t and R{t,r)/a{t) respectively. This gives us 
(treating g as a function of t and r), 



idfP)it,r) 



q R^R'dr , 



{djp){t,r)=q-^ 1^ qR^R'dr., 



6a 



(5.57a) 
(5.57b) 
(5.57c) 



where in the last equation we have used the fact that, at leading order, dfq = {a/R')q'. 

Also, noting that the time derivatives in Eqns. ()4.42p are taken keeping the coordinate r fixed, 
we have at the leading order, and in terms of t ~ r 



idfm,r) 



aR'^ Jo 



q R^R'dr 



1 



qR^R'dr 



aR^ JO 
9 - -^g ^ qR^R'dr, 



(5.58a) 
(5.58b) 
(5.58c) 



which follow from Eqns. (j5.56p . The expressions in Eqns. (j4.42p . rewritten in terms of the LTB 
proper time t and valid at leading order in the various small quantities, reduce to 



2{i^r)-{{d',py)-2{{l/f')idff3) 



5« = -4 



6( {dfipr ) - ( {dl(3f ) - 6( {dfP - f&iaflr^ ) 



•p(i) _^7:.(2) ^ _2i7 



( (dlmVn) ) + 2( (i/r~2)(a,/3)(a,/3) ; 



5(2) = ( (df^ + HdfP)ia^HdfP - dfif) 



(5.59) 



where the angular brackets are now defined by Eqn. (j5.53p and the various integrands can be read 
off using Eqns. (j5.57p . (j5.58p and the results dfif ~ {a/R')ip' and f {R/a), at leading order. Figs. 
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Figure 5.6: The evolution of liS*-^-*!, normalised by 6H^. Also shown is a hypothetical curvature-like correc- 
tion, evolving like ^ a^^. 

15.61 and 15.71 show results of numerical calculations performed with Mathematica. Fig. 15.61 shows the 
evolution of —S^^^ / {6H'^), the dominant correction, as a function of the scale factor. The dotted line 
shows a hypothetical curvature like correction. Clearly the evolution of the actual backreaction is 
more complicated, due to significant evolution of if. Note that the largest value of \S^^^/H'^\ 
computed here is ~ 10~^, whereas estimates using linear theory in chapter 4 suggested that this 
value should be around ~ 10~^. This discrepancy highlights an issue we noticed earlier in chapter 
4, namely that nonlinear inhomogeneities on small scales do not contribute significantly to the 
backreaction. Our model has no large scale inhomogeneities and underestimates the backreaction. 
Reassuringly, accounting for the deficit only requires a calculation in linear theory, such as the one 
in chapter 4. Fig. 15.71 shows the evolution of the remaining integrals, also normalised by 6H^. 
An initial rapid decay of V^^^ /H"^ starting from values of ~ 10~^ has not been shown, in order to 
enhance the contrast in the late time behaviour of the three functions. The other functions remain 
subdominant compared to V^^^ at the early times not shown. 

This completes the picture of the effects of backreaction in the cosmic expansion history. Our 
covariant and self-consistent calculation of the backreaction in this spherical collapse model es- 
tablishes that inhomogeneities have an insignificant impact on the average cosmological dynamics. 
In particular, the observed cosmic acceleration cannot be explained by the averaging of inhomo- 
geneities. Our nonlinear dust model can be regarded as representing a realistic situation, because 
it has a overdensity-void structure, and departure from sphericity, tidal interactions, and second 
order corrections are not expected to introduce any significant change in the results. What appears 
true in general is that as long as peculiar velocities remain small, as seems to be the case in the 
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Figure 5.7: The normalised evolution of the backreaction functions other than iS*-^-*. To enhance contrast, 
a strongly decaying early time mode for V'^^^/H^ has not been shown. 

real universe, a description as a perturbed FLRW model is valid, and this keeps the backreaction 
small. 

Chapter summary and discussion: 

In this chapter we addressed the question of whether or not the perturbed FLRW form for the metric 
remains a valid approximation at late times in the cosmological history. Heuristic arguments such 
as those presented by Ishibashi and Wald [27] indicate that this should in fact be the case. We have 
studied this issue in the context of an exactly solvable, fully relativistic toy model which allows us 
to track the behaviour of both the metric and matter perturbations unambiguously, well into the 
regime where the matter perturbations have become completely nonlinear. 

In these late stages of structure formation, a perturbation theory in the density contrast 5 
is no longer valid. As expected from standard Newtonian analyses however [TTj, we found that 
a perturbative expansion in peculiar velocities remains valid until fairly late times. Our model 
parameters were chosen to reflect typical inhomogeneities on scales of tens of Mpc. While the 
perturbative description of our model eventually breaks down (see e.g. the (1 — cosu)~^ factor 
in the expression (j5.39p for cp), the model itself is not expected to reflect realistic conditions at 
very late times. For example, real clusters of galaxies eventually achieve virial equilibrium due to 
random motions of the galaxies, rather than collapsing to a singularity. Our pressureless model 
cannot accomodate such a behaviour and must be abandoned beyond the point where virialisation 
is expected to occur. Heuristic reasoning further suggests that in the virial phase, the peculiar 
velocity with respect to the Hubble flow will essentially be \av\ ~ HR^ij. where iJvir is the virial 
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radius, and the backreaction which is controlled by (av)'^ should therefore decrease with time in this 
phase. Our model parameters were chosen so that the physical radius of the overdensity at i = to 
is close to typical virial radii for corresponding realistic clusters, and it is therefore meaningful to 
stop the model evolution at t = to- 

Additionally we also computed the backreaction in our model using the formalism developed in 
chapter 4. We emphasize that this formalism is fully covariant, and is guaranteed to yield results 
which are coordinate invariant at the leading order. The metric perturbation function ip does not 
satisfy the equations of linear perturbation theory at late times, and neither is it expected to. This 
is reflected in the complicated behaviour of 5*-^^ seen in Fig. 15. 6i However it is still true that the 
metric can be brought to the perturbed FLRW form, which is all that is required for a reliable 
calculation of the backreaction. And finally, since the nonlinear inhomogeneities do not contribute 
as much to the backreaction as those on scales k ~ kgg, our entire discussion regarding convergence 
of the iterative calculation (see chapter 4 summary and discussion) is expected to remain approx- 
imately valid even after accounting for nonlinearity. This is important since it establishes that 
cosmological perturbation theory is stable against including the effects of the backreaction from 
averaging inhomogeneities. 
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Chapter 6 

Conclusions 



Backreaction as an explanation for the late time cosmic acceleration would have truly been the 
most conservative solution to the dark energy problem. Not only would it have resolved the 
discrepancy between observed data and what is generally considered to be "ordinary physics" , but 
more importantly it would have obviated the need for statements such as "We do not understand 
what 70% of the universe is made of"0. Needless to say, this approach has captured the imagination 
of many cosmologists, and the (possibly incomplete) list of references cited in the Introduction is 
testimony to this fact. Due to the technically challenging nature of the problem however, it is 
very important to proceed systematically and rigorously while determining the size and nature of 
the effects of backreaction. This is especially true since order of magnitude estimates on the one 
hand indicate that the effect can never be large |27j . while simple toy models indicate exactly the 
opposite [37] . It has been our goal in this thesis to provide a reliable and self-consistent calculation 
to estimate the nature of the backreaction. 

We have used a fully covariant averaging formalism, adapted to the specific needs of cosmology, 
and developed it further to allow estimates of the effects of perturbative metric inhomogeneities 
in a gauge invariant manner. We have used this formalism in the linear regime of cosmological 
perturbations and have shown that linear PT is stable against the inclusion of backreaction effects. 
We have also demonstrated using simple but realistic toy models of nonlinear structure formation, 
that (a) a description of the universe in terms of the perturbed FLRW metric is indeed valid at late 
times as order of magnitude estimates suggest, and (b) the backreaction due to these perturbative 
metric inhomogeneities remains small, irrespective of whether the matter inhomogeneities are in 
the linear or nonlinear regime. One might argue that more precise nonlinear calculations, e.g. in 
numerical simulations, might enhance the effect. While this might be true, this enhancement is not 
expected to be by orders of magnitude, since the transfer functions routinely found in nonlinear 
calculations do not deviate drastically enough from the linear theory ones [15j, to lead to an order 

Including dark matter would take this number up to approximately 95%. If we further take into account the fact 
that the only component which we directly measure with great precision is the CMB radiation, then one might say 
that we truly understand only ~ 10~* of the universe! 
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unity backreaction. The cosmological expansion history therefore appears to be largely insensitive 
to the presence of the backreaction terms. 

Of course, this means that backreaction from averaging cannot solve the dark energy problem. 
There have been claims in the literature which suggest otherwise; we will comment on a few of 
these in what follows. Kolb et al. [51j study a model with very large nonlinear inhomogeneities (a 
~ 1 Gpc sized void) to claim that in such a case the perturbed FLRW form for the metric cannot 
be recovered at all times, and that backreaction effects can be large. Our results do not contradict 
this, since we have seen that the quantity HR controls the late time perturbative expansion, and 
a ~ 1 Gpc size inhomogeneity with a density contrast of order unity today will imply a breakdown 
in the perturbation theory. What is important to bear in mind however, is whether such large 
inhomogeneities are generic. If the universe is dominated by Gpc sized voids then our analysis will 
indeed break down; but this does not appear to be the case observationally. 

Wiltshire |38j models the scale dependence of inhomogeneities in a nonperturbative manner 
using Buchert's formalism. The assumption is that the clocks of observers in voids run at very 
different rates from those of observers in the "walls" surrounding voids. The argument then is that 
since the universe today is dominated by voids (of sizes ranging from 30-50/i~^ Mpc [56]), "wall" 
observers such as ourselves are atypical. The difference in clock rates between average observers 
and wall-observers is then fitted to data and can account for several sets of observations. This 
was a rather simplistic picture of Wiltshire's model, which is actually far more involved in its 
construction. However, Wiltshire's basic final result, the clock rate difference mentioned above, 
at least superficially does not agree with our results, since in our toy model we explicitly see that 
clock rate differences for any pair of observers are governed by the metric potential ip which remains 
perturbatively small. It is not clear where this discrepancy arises from, and this matter is further 
complicated by the fact that Wiltshire's model does not have a concrete setup in which to follow 
the evolution of inhomogeneities. 

To date, perhaps the most physically clear attempts to explain e.g. supernova data without 
invoking dark energy, have involved the "non-Copernican" models which place us at a special 
location in the universe [57j. At the risk of some repetition, we will briefly discuss a few issues that 
arise in this context. As we discussed in the Introduction, a lot of attention has been focused on 
studying light propagation in the inhomogeneous spacetime of a void modelled by the LTB metric. 
These calculations involve standard GR, without any averaging and the associated complications. 
As far as supernova data are concerned, the calculations involve light propagation in a known 
metric, and the two functional degrees of freedom in LTB models can be suitably employed to 
construct a void geometry (which will typically span several hundred Mpc) which fits the 
supernova data without any dark energy /cosmological constant. This apparently straightforward 
result leads to a host of questions. Do such "super"-voids exist in the universe? Galaxy surveys 
at least do not show any evidence of this, although some analyses of CMB data, specifically using 
the integrated Sachs- Wolfe effect, have thrown up some interesting results [53]. Even if such voids 
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exist, is there any independent evidence that we reside close to the center of one? (Were we not 
close to the center, the CMB dipole would have been much higher.) Current data is not sufficient 
to answer this question, although future surveys are likely to make some headway in this issue 
[60j . On the theoretical front, how does one analyse the growth of structure in a geometry which 
contains a nonperturbative void? This question has begun to receive some attention only recently 
(see the papers by Zibin, Moss and Scott, and Clifton, Ferreira and Zuntz, in Ref. [57]). The jury 
is therefore still out on whether or not "dark energy" is simply a consequence of our being in a 
special location. The most encouraging aspect of the "void instead of dark energy" scenario is that 
it is amenable to observational testing, which is very likely to occur in the foreseeable future. 

To conclude, backreaction from inhomogeneities cannot solve the dark energy problem. Void- 
like inhomogeneities, while having the potential to solve this problem, await further observational 
evidence. And for now, we still do not understand what (at least) 70% of the universe is made of. 
The future continues to hold significant challenges for cosmology and theoretical physics. 
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Appendix A 

Basics of FLRW cosmology 



The unique line element for a spacetime which admits homogeneous and isotropic spatial sections 
(i.e. 3-surfaces of constant curvature) is given by [3] 

ds'^ = -dr^ + a2(r) —r + r^dO^ + sin^ edcp'^ . (A.l) 

\ 1 — Kr^ J 

Here A; is a parameter related to the spatial curvature of 3-space ((3)71 = ek/a^) and a(r) is the 
single dynamical function - the scale factor - which describes the evolution of the universe. This 
thesis will mostly deal with the case k = corresponding to spatially flat sections. The line 
element is written in coordinates which are comoving with those observers who see a homogeneous 
and isotropic 3-space. The coordinate r is called cosmic time. The subscript will refer to the 
present epoch tq and the scale factor is always normalized so that a(ro) = uq = 1. 

The energy-momentum tensor of matter is taken to describe a perfect fluid, which is homoge- 
neous and isotropic in its rest frame (which therefore coincides with the comoving reference frame 
of Eqn. (jA.ip ). This has the form 

n ^ diag(-p(T),p(T),p(T),p(T)) , (A.2) 

with p{t) the energy density and p{t) the pressure as measured in the fluid rest frame. 

The Einstein equations Eab = SnGTab, with Eat the Einstein tensor constructed using (jA.ip 
and Tab as given above, reduce tcQ 

d \ ^ k SvrG 

- + — = ^P> (A.3a) 
a J 3 

a 47rG , , / . , ^ 

- = ^ P + 3p , A.3b 

a 3 

with the over dot referring to a time derivative ' = dr- We will refer to Eqns. ()A.3aP and ()A.3bp as 



^Ean. (IA.3b|l is actually obtained after substituting (IA.3ap in the Einstein equation for a. 
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the Friedmann equation and the acceleration equation respectively. Together with an equation of 
state p = p{p), these equations can be solved to get an expression for p{a) and a(T). (One could also 
use the Friedmann equation ()A.3aP in conjunction with the continuity equation d{pa^) +pd{a^) = 
which follows from T^^ = 0, to solve for p{a) and a(T).) For example, with the simple assumption 
p = wp for constant w, it can be shown that p oc a^'^'^^^'^\ and further if /c = then 

2 

a(r) oc r='(i+™) ^ w ^ —1 

oc exp(ffr) , w = -1 , (A.4) 



where H is constant. Some commonly occurring values of the equation of state parameter w are 
= (pressureless matter or dust) and u; = 1/3 (radiation). The case w = —1 corresponds to 
the cosmological constant A, which gives rise to a constant energy density p\ = A/(87rG), and 
consequently a value of H = a/A/3 in Eqn. (\AA\i above. 

In general we define H = {a /a), called the Hubble parameter. Its value at the present epoch 
(Hq) is called the Hubble constant, and is usually parametrized as Hq = 100/ikms~^Mpc~^, with 
the current consensus on its numerical value being 0.5 ^ h < 0.8. Define the critical density pc at 
the present epoch as pc = {3HQ)/{8TrG), and also the quantities Q-i = Pio/pc where pio is the value 
of the density of the i^^ matter component, at the present epoch. Here i labels the components 
radiation (R), baryons (b), dark matter (DM) and an additional "dark energy" represented by the 
cosmological constant (A). The total energy density is simply p{a) = ^iPi{o) and we can write 
0(a) = p{a)/pc, with the value of 0(a) at the present epoch being = 0(a = 1) = The 
Friedmann equation (jA.SaP can be written as 

H\a) = Hi [(1 - 0) a-2 + J^^a"^ + (J^^ + ^dm) a'^ + Oa] , (A.5) 

where the cases A: = 0, A;>0,A;<0 correspond respectively to = 1, > 1, O < 1, and in 
general one would have Oa = OA(a). The cosmological redshift z of a source which emits light of 
wavelength Aem at time rem and is observed "here-and-now" at time tq with wavelength Aq, is 

l + 1 . (A.6) 

Using this in ()A.5P gives an expression for H{z). Often it is useful to work in terms of conformal 
time rj defined by 

with the corresponding Hubble parameter 

n^-^-^ = aH{a) . (A.8) 
a adrj 
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Appendix B 

The Lemaitre-Tolman-Bondi solution 



In this appendix we describe the spherically symmetric Lemaitre-Tolman-Bondi (LTB) solution 
of the Einstein equations for matter comprising a pressureless "dust" |61j . For an arbitrary dust 
configuration, the metric can always be expressed in coordinates which are "synchronous" {goo = 
— 1; goA = 0) and "comoving" (world lines of the fluid elements are orthogonal to 3-space) [62] , 
Specifically, the LTB metric is given in the synchronous and comoving gauge, by 

ds'^ = -dt^ + ^ ^Y}o dr'^ + R^{t. (dO^ + sin^ ^dct>^) ■ (B.l) 
1 — k(r)r'^ 

Throughout this appendix, a prime and a dot will refer to partial derivatives with respect to r and 
t respectively. The Einstein equations simplify to 

Surfaces of constant r are 2— spheres having area 47ri?^(t,r). p{t,r) is the energy density of dust, 
while k{r) and M(r) are arbitrary functions that arise on integrating the dynamical equations. 
Solutions can be found for three cases k{r) < 0, k{r) = and k{r) > 0. The solution for k{r) = 
(the marginally bound case) has the particularly simple form 

1 /3 

^(i,0=(^^f^) {t-to{r)f\ foTk{r) = 0. (B.3) 

Here to{r) is another arbitrary function arising from integration. The solution describes an expand- 
ing region, with the initial time tin chosen such that t > tin > foi' ^ f- For the other two 
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cases, the solutions can be written in parametric form 

GMir) GM(r) 

R = — , , , „ (cosh// — 1) ; t — to(r) = -7- (sinhr/ — rj) , < rj < 00 , for k(r) < . 

-/c(r)H (^_k{r)r'^y' 

(B.4a) 

GMir) GMir) 

R= , , , , (1-COS7?) ; t-to(r) = ^-r (r? - sinr/) , < ?? < 27r, for A;(r) > . (B.4b) 

kir)r^ (k{r)r^y'^ 

In the unbound case {k{r) < 0), R{t,r) increases monotonicahy with t, for every shell with label r. 
In the bound case {k{r) > 0), R{t,r) increases to a maximum value Rmaxif) for each shell r and 
then decreases back to in a finite time. 

In all cases, there are two physically different free functions, although three arbitrary functions 
k, M and to appear. One of the three represents the freedom to rescale the coordinate r. We 
use this freedom to se10 R{tin,r) = Rin{r) = r. To completely specify the solution, we specify 
the initial density Pin{f) and the function k[r) (which can be related to the initial velocity profile 
Rin{f) using Eqn. ()B.2ap evaluated at initial time). This specifies M{r) = Air Jq Pin{r)f^df (which 
in the marginally bound case is interpreted as the mass contained in a comoving shell), and to(^) 
can be solved for using Eqns. dES]), (iRia]) or (iRibj) as the case may be, at time t = tin- The 
FLRW solution is a special case and is recovered by setting pin = constant, k = constant. 

B.l Regularity conditions 

It is useful to keep in mind certain regularity conditions when constructing LTB models. In any 
LTB model, the functions M(r) and k{r) are to be specified by initial conditions at t = tin, and 
the choice of scaling R{r,tin) = r fixes tQ{r) as 

t.ir) - 1 S ir) ■ C ir) - fB 51 

where Sinir) = (sinh r/j„(r) - ??i„(r)) and Ci„(r) = cosh r?j„(r)-l for A;(r) < 0; Sinir) = {'qin{r) - sin r/j„(r)) 
and Gin{r) = 1 — cosr]in{r) for k{r) > 0. 

The regularity conditions imposed on this model, and their consequences, are as follows 

• No evolution at the symmetry centre: 

This is required in order to maintain spherical symmetry about the same point at all times, 
and translates as R{t, 0) = for all t. The right hand side of Eqn. ()B.2aP must therefore 
vanish in the limit r — > 0. Since the functions involved are non-negative, we assume that we 
can write 

\kir 0)\ rf" , p > -2 ; M(r ^ 0) ~ r° ; R{t, r ^ 0) ~ r^/(t) , a > /3 > . (B.6) 



^In the main text we also use a slightly modified rescaling when convenient. 
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Consistency requires (3 to be constant, and our scaling choice further requires (3 = 1. We do 
not require the exponents and a to necessarily be integers. 

• No shell- crossing singularities: 

Physically, we demand that an outer shell (labelled by a larger value of r) have a larger area 
radius R than an inner shell, at any time t. Unphysical shell-crossing singularities arise when 
this condition is not met. Mathematically, this requires 



• Regularity of energy density: 

We demand that the energy density r) remain finite and strictly positive for all values of 
r and t. Combining this with Eqns. (lB.2b[) and ()B.7|1 gives (assuming that R' is finite for all 
r and since (3 = 1) 



• No trapped shells: 

In order for an expanding shell to not be trapped initially, it must satisfy the condition 
r > 2GM{r). Near the regular center, this condition is automatically satisfied independent 
of the exact form of M(r), since there M ~ r^. 



R'{t,r)>0 for all r, for alH. 



(B.7) 



lim p{t, r) = finite a — 1 — 2/3 = 0=^> a = 3. 



(B.8) 
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Appendix C 

Cosmology in MG 



In this appendix we give proofs of several results that were used in chapter 2. 

C.l Analysis of D^^^"^ = 

We start with the metric 

(^)ds2 = 5oo(t,x)dt^ + gAB{t,^)dx^dx^ , (C.l) 

on M. and assume that it averages out to the FLRW form (Eqn. (I3.7ip ): 

Goo = (5oo) = -/'(0 ; G'oa = (5oa) = ; Gab = {qab) = a\t)5AB ■ (C.2) 

We will analyze the second relation of Eqn. (j3.54p and show that it leads to the result C/*-^ = 
gi-3 _ Q^] = 0, where $7"^^ refers to the connection 1-forms associated with Gij given by 

Cl\ = dt{\nf)dt ■ n\ = H6idx'', 
Cl\ = ^H5ABdx'' ; n^s = H6idt, (C.3) 

where H = {1/ a){da/ dt) for this section. We have 

dg''^ + Cl^jg^^ + n^^g"^ = . (C.4) 

Consider the three cases (a = 6 = 0), (a = 0, 6 = -B) and [a = A^h = B) in turn. The first case 
gives 

dffO" + 2nOo5°° + = , (C.5) 
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which reduces to 

[at5°° + 2ai(in/)5°°] dt + 

We can conclude that 



5a5°° + 2^HdABg 



OB 



dx^ = . 



^°°(t,x) 



^(x) 



Pit) ' 
aAA;(x) = 2a^H5AB9°^ 



(C.6) 

(C.7a) 
(C.7b) 



where A;(x) is a positive definite function (so that the metric signature is preserved) which arises 
as an integration constant and is constrained by Eqn. (|C.7bp . The second case {a = 0,b = B) leads 
to 



[%o^ + 9iln(a/)5n dt + 
which gives us 



dx-^ = 0. 



-OB 



m^(x) 



^ " -.Bf^ , TJX T,AB 



—djm {yi) + —,H5Ajg TT^^^J 

af 



(C.8) 



(C.9a) 



(C.9b) 



where m^(x) is a 3- vector that arises as a constant of integration like A:(x), and is constrained by 
Eqn. dOObl) . Finally, the last case {a = A,b = B) leads to 



[dtg^^ + 2Hg^^] dt + 
which gives us 



djg^B + -^H {6jm^{^) + <5f m^(x)) 
a J 



dx-^ = . 



^AB 



1 



S^^fx) 



^5js^^(x) + ^^H (5>^(x) + 5f m^(x)) = . 



(C.IO) 



(C.lla) 



(C.llb) 



Here s'^^(x) is another constant of integration, a symmetric 3-tensor. Now, since the left hand 
side of Eqn. (jC.Tbp is independent of time, either the time dependence of the right hand side must 
cancel, or both sides must vanish. For the time dependence to cancel, we need / oc {da/dt) which 
is not expected a priori. Therefore both sides of Eqn. ()C.7bp must vanish, which immediately tells 
us that the vector m-^(x) must vanish, and the function A;(x) must be a constant. 



A;(x) = k = constant ; m^(x) = 



Equations ([aObl) and (IC.llbl) then give us 



(C.12) 



(C.13) 
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with the same constant k as in Eqn. ()C.12p . Finally, putting everything together we find 



gij ^ f.Qij _ (C.14) 



The constant k is not constrained by any of the equations and appears to be a free parameter in 
the theory. The modified Einstein equations ()3.58p show that k can be absorbed into the averaged 
energy momentum tensor. We will for simplicity assume k to be unity thereby obtaining, as required 

U^j =gV -G'^ = 0. (C.15) 

C.2 Analysis of the condition (r;^,) = (FLRW)pa^ 

Here we will assume that the line element on M is in the volume preserving gauge 

(M)ds'^ = --rj^ + hABit,^)dx^dx^ , (C.16) 

so that the averaging is trivial, and the metric and averages out to the FLRW line element on M 
given by 

^^^ds^ = -TTvf^ + a\t}6ABdx^dx^ , (C.17) 
{h){t) 

where we used the condition (l/h) = l/{h) that follows from g^^ = G^^. The conditions {T\ 



be I 



(FLRW)-pa^ then result in the following 



r[Jo : (af(ln^))=a,-(lny(7r)), (C.18a) 

r^oA ■■ {dA{lnVh)) = 0, (C.18b) 
i^AB 

r^o : {^dB{ln^))=0, (C.lSc) 

: {l=ei) = H6i, (c.isd) 

r% ■■ {^Oab) = {h)d^H6AB, (C.lSe) 
ric ■■ ((%^c> = 0. (C.lSf) 

Eqns. (IC.18b|) and (|C.18f|) are consistent with each other since '•^■'r;^^ = (?B(ln\//i), and Eqn. 
(|C.18cP is consistent with the assumption Eqn. (j3.53ap . The trace of Eqn. ()C.18dp gives ( (1/a//i)0 ) 
3H. However we have {l/Vh)Q = NQ = dt{ln y/h) , and combined with Eqn. ()C.18aP this gives 

^d^iHh)) = 3dt{lna) ^{h)=d\ (C.19) 
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where we have set an arbitrary proportionahty constant (representing rescahng of the time coordi- 
nate by a constant) to unity. This estabhshes the last equality in Eqn. (j3.78p . 

Finally, consider the trace (( /i"^^ )/( \/7i0ab ) : using the condition g"^^ = G"^^ , Eqn. 
(|(].18e|) and the trace of Eqn. (|C.18dp . this gives us 

^JJ^iVhQAB ) = J^^i ^®AB )=3H ={—&) = { -^(VhOAB) ) . (C.20) 
On using the condition (IS.SSah this leads to 



which is consistent with the assumption 



uAB \ 



(C.21) 



/ j^AB \ j^AB 
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